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(57) Abstract: This invention relates to a method to promote the differentiation of stem cells, typically embryonic stem cells, through 
the use of RNA interference, by the introduction of stem loop RNA into a cell. 
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Method for Modulating Stem Cell Differentiation Using Stem Loop RNA 

The invention relates to a method to modulate stem cell differentiation comprising 
introducing stem loop containing RNA into a stem cell to ablate mRNA's which 
5 encode polypeptides which are involved in stem cell differentiation; stem loop 
RNA's ; and nucleic acid molecules and vectors encoding stem loop RNA's. 

A number of techniques have been developed in recent years which purport to 
specifically ablate genes and/or gene products. For example, the use of anti-sense 

10 nucleic acid molecules to bind to and thereby block or inactivate target mRNA 
molecules is an effective means to inhibit the production of gene products. This is 
typically very effective in plants where anti-sense technology produces a number of 
striking phenotypic characteristics. However, antisense is variable leading to the 
need to screen many, sometimes hundreds of, transgenic organisms carrying one or 

15 more copies of an antisense transgene to ensure that the phenotype is indeed truly 
linked to the antisense transgene expression. Antisense techniques, not necessarily 
involving the production of stable transfectants, have been applied to cells in culture, 
with variable results. 

20 In addition, the ability to be able to disrupt genes via homologous recombination has 
provided biologists with a crucial tool in defining developmental pathways in higher 
organisms. The use of mouse gene "knock out" strains has allowed the dissection of 
gene function and the probable function of human homologues to the deleted mouse 
genes, (Jordan andZant, 1998). 

25 

A much more recent technique to specifically ablate gene function is through the 
introduction of double stranded RNA, also referred to as inhibitory RNA (RNAi), 
into a cell which results in the destruction of mRNA complementary to the sequence 
included in the RNAi molecule. The RNAi molecule comprises two complementary 
30 strands of RNA (a sense strand and an antisense strand) annealed to each other to 
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form a double stranded RNA molecule. The RNAi molecule is typically derived 
from exonic or coding sequence of the gene which is to be ablated. 

Surprisingly, only a few molecules of RNAi are required to block gene expression 
5 which implies the mechanism is catalytic. The site of action appears to be nuclear as 
little if any RNAi is detectable in the cytoplasm of cells indicating that RNAi exerts 
its effect during mRNA synthesis or processing. 

The exact mechanism of RNAi action is unknown although there are theories to 
10 explain this phenomenon. For example, all organisms have evolved protective 
mechanisms to limit the effects of exogenous gene expression. For example, a virus 
often causes deleterious effects on the organism it infects. Viral gene expression 
and/or replicationtherefore needs to be repressed. In addition, the rapid development 
of genetic transformation and the provision of transgenic plants and animals has led 
15 to the realisation that transgenes are also recognised as foreign nucleic acid and 
subjected to phenomena variously called quelling (Singer and Selker, 1995), gene 
silencing (Matzke and Matzke, 1998) , and co-suppression (Stam et. al., 2000). 

Initial studies using RNAi used the nematode Caenorhabditis elegans. RNAi 
20 injected into the worm resulted in the disappearance of polypeptides corresponding to 
the gene sequences comprising the RNAi molecule(Montgomery et. al., 1998; Fire et. 
al., 1998). More recently the phenomenon of RNAi inhibition has been shown in a 
number of eukaryotes including, by example and not by way of limitation, plants, 
trypanosomes (Shi et. al., 2000) Drosophila spp* (Kennerdell and Carthew, 2000). 
25 Recent experiments have shown that RNAi may also function in higher eukaryotes. 
For example, it has been shown that RNAi can ablate c-mos in a mouse ooctye and 
also E-cadherin in a mouse preimplanation embryo (Wianny and Zernicka-Goetz, 
2000). 

30 The use of RNAi to ablate stem cell RNA is disclosed in our co-pending application, 
WO 02/16620, which is incorporated by reference. 
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During mammalian development those cells that form part of the embryo up until the 
formation of the blastocyst are said to be totipotent (e.g. each cell has the 
developmental potential to form a complete embryo and all the cells required to 
5 support the growth and development of said embryo). During the formation of the 
blastocyst, the cells that comprise the inner cell mass are said to be pluripotential 
(e.g. each cell has the developmental potential to form a variety of tissues). 

Embryonic stem cells (ES cells, those with pluripotentiality) may be principally 
10 derived from two embryonic sources. Cells isolated from the inner cell mass are 
termed embryonic stem (ES) cells. In the laboratory mouse, similar cells can be 
derived from the culture of primordial germ cells isolated from the mesenteries or 
genital ridges of days 8.5-12.5 post coitum embryos. These would ultimately 
differentiate into germ cells and are referred to as embryonic germ cells (EG cells). 
15 Each of these types of pluripotential cell has a similar developmental potential with 
respect to differentiation into alternate cell types, but possible differences in 
behaviour (eg with respect to imprinting) have led to these cells to be distinguished 
from one another . 



20 Typically ES/EG cell cultures have well defined characteristics. These include, but 
are not limited to; 



i) maintenance in culture for at least 20 passages when maintained on fibroblast 
feeder layers; 

25 ii) produce clusters of cells in culture referred to as embryoid bodies; 

iii) ability to differentiate into multiple cell types in monolayer culture; 

iv) can form embryo chimeras when mixed with an embryo host; 

v) express ES/EG cell specific markers. 



30 Until very recently, in vitro culture of human ES/EG cells was not possible. The first 
indication that conditions may be determined which could allow the establishment of 
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human ES/EG cells in culture is described in W096/22362. The application 
describes cell lines and growth conditions which allow the continuous proliferation 
of primate ES cells which exhibit a range of characteristics or markers which are 
associated with stem cells having pluripotent characteristics. 

5 

More recently Thomson et al (1998) have published conditions in which human ES 
cells can be established in culture. The above characteristics shown by primate ES 
cells are also shown by the human ES cell lines. In addition the human cell lines 
show high levels of telomerase activity, a characteristic of cells which have the 

10 ability to divide continuously in culture in an undifferentiated state. Another group 
(Reubinoff et. al., 2000) have also reported the derivation of human ES cells from 
human blastocyts. Shamblott et al, 1998 have also described EG cell derivation. In 
Lake et al J Cell Science 2000, 113:555-66 and Rathjen et al J Cell Science 1999, 
1 12: 601-12, ectodermal stem cells are disclosed. The above references are each both 

1 5 incorporated by reference in their entirety. 

A feature of ES/EG cells is that, in the presence of fibroblast feeder layers, they 
retain the ability to divide in an undifferentiated state for several generations. If the 
feeder layers are removed then the cells differentiate. The differentiation is often to 
20 neurones or muscle cells but the exact mechanism by which this occurs and its 
control remain unsolved. 

In addition to ES/EG cells a number of adult tissues contain cells with stem cell 
characteristics. Typically these cells, although retaining the ability to differentiate 

25 into different cell types, do not have the pluripotential characteristics of ES/EG cells. 
For example haemopoietic stem cells have the potential to form all the cells of the 
haemopoietic system (red blood cells, macrophages, basophils, eosinophils etc). All 
of nerve tissue, skin and muscle retain pools of cells with stem cell potential. 
Therefore, in addition to the use of embryonic stem cells in developmental biology, 

30 there are also adult stem cells which may also have utility with respect to determining 
the factors which govern cell differentiation. . Further recent studies have suggested 
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that some stem cells previously thought to be committed to a single fate, (e.g 
neurons) may indeed possess considerable pluripotentcy in certain situations. Neural 
stem cells have recently been shown to chimerise a mouse embryo and form a wide 
range of non-neural tissue (Clark et al., 2000). 

5 

A further group of cells which have relevance to developmental biology are 
pluripotent embryonal carcinoma cells (EC cells) which are stem cells of 
teratocarcinomas, also referred to as teratomas, which are able to differentiate into all 
cell types found in these tumours. A teratocarcinoma also includes teratocarcinoma 
10 cells which do not have the full pluripotential characteristics of an EC cell but 
nevertheless can differentiate into a restricted number of differentiated tissues. These 
cells have many features in common with ES/EG cells. The most important of these 
features is the characteristic of pluripotentiality. 

15 Teratomas contain a wide range of differentiated tissues, and have been known in 
humans for many hundreds of years. They typically occur as gonadal tumours of 
both men and women. The gonadal forms of these tumours are generally believed to 
originate from germ cells, and the extra gonadal forms, which typically have the 
same range of tissues, are thought to arise from germ cells that have migrated 

20 incorrectly during embryogenesis. Teratomas are therefore generally classed as germ 
cell tumours which encompasses a number of different types of cancer. These include 
seminoma, embryonal carcinoma, yolk sac carcinoma and choriocarcinoma. 

The similar biology of EC cells with ES/EG cells has been exploited to study the 
25 developmental fates of cells and to identify cell markers commonly expressed in EC 
cells and ES/EG cells. For example, and not by way of limitation, the expression of 
specific cell surface markers SSEA-3 (+), SSEA-4 (+), TRA-1-60 (+), TRA-1-81 (+) 
(Shevinsky et al 1982; Kannagi et al 1983; Andrews et al 1984a; Thomson et al 
1995); alkaline phosphatase (+) (Andrews et. al, 1996); and Oct 4 (Scholer et. al., 
30 1989; Kraft et. al, 1996; Reubinoff et. al., 2000; Yeom et. al, 1996). 
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We have accumulated expression studies which identify a number of genes thought 
to be involved in determining the developmental fate of stem cells, particularly 
embryonic stem cells. By northern blotting we have identified the expression of 
human homologs of two signalling pathways believed to be critical in cell fate 
5 determination. Expression of ligands, receptors and downstream components of the 
Notch and Wingless signalling cascades have been elucidated. Using the model 
system NTERA2/D1 embryonal carcinoma cells we have recorded changes in the 
expression of some of these components as the cells differentiate. Bearing in mind 
the role these cascades play in embryonic development throughout the animal 

10 kingdom, these changes suggest a significant role for both the wingless and Notch 
signalling pathways in differentiation of stem cells. Furthermore the activity of some 
genes are required for differentiation to occur along specific pathways e.g. the 
myogenic gene MyoDl. Other genes have activity which inhibits cellular 
differentiation along particular pathways. We envisage regulation of stem cell 

1 5 differentiation to yield a specific cell type could be achieved by: 



(i) inhibition of certain genes that normally promote differentiation along 
particular pathways; therefore promoting differentiation to alternate cell 
phenotypes; 

20 (ii) inhibition of gene activity that prevents differentiation into particular cell 
types; and 

(iii) a combination of (i) and (ii), see figure 1 



25 In our co-pending application, WO02/16620, we introduce RNAi molecules 
homologous to genes encoding factors involved in stem cell differentiation. The 
differentiation of stem cells during embryogenesis, during tissue renewal in the adult 
and wound repair is under very stringent regulation; aberrations in this regulation 
underlie the formation of birth defects during development and are thought to 

30 underlie cancer formation in adults. 

6 
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Generally, it is envisaged that stem cells are under both positive and negative 
regulation which allows a fine degree of control over the process of cell proliferation 
and cell differentiation: excess proliferation at the expense of cell differentiation can 
lead to the formation of an expanding mass of tissue - a cancer - whereas express 
5 differentiation at the expense of proliferation can lead to the loss of stem cells and 
production of too little differentiated tissue in the long term, and especially the loss 
of regenerative potential. Certain genes have already been identified to have a 
negative role in preventing stem cell differentiation. Such genes, like those of the 
Notch family, when mutated to acquire activity can inhibit differentiation; such 
10 mutant genes act as oncogenes. On the contrary, loss of function of such genes on 
their inhibition results in stem cell differentiation. 

We propose to use EC cells has a model cell system to follow the effects of 
perturbations in stem cell differentiation. We further propose an alternative approach 
15 to introduce double stranded RNA molecules into stem cells to ablate mRNA's. 

The invention relates to the provision of stem-loop RNA structures which can either 
be synthesised in vitro followed by transfection into a stem cell, or alternatively, 
synthesised in vivo by the stem cell from vectors which are provided with expression 
20 cassettes which include a DNA molecule which includes the coding sequence for the 
stem-loop RNA. 

The DNA molecule encoding the stem-loop RNA is constructed in two parts, a first 
part which is derived from a gene the regulation of which is desired. The second part 

25 is provided with a DNA sequence which is complementary to the sequence of the 
first part. The cassette is typically under the control of a promoter which transcribes 
the DNA into RNA The complementary nature of the first and second parts of the 
RNA molecule results in base pairing over at least part of the length of the RNA 
molecule to form a double stranded hairpin RNA structure or stem-loop. The first 

30 and second parts can be provided with a linker sequence. 
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According to a first aspect of the invention there is provided a method to modulate 
the differentiation state of a stem cell comprising: 

(i) contacting a stem cell with at least one nucleic acid molecule comprising a 
5 sequence of a gene which mediates at least one step in the differentiation of said cell 
which nucleic acid molecule consists of a first part linked to a second part wherein 
said first and second parts are complementary over at least part of their length and 
further wherein said first and second parts form a double stranded region by 
complementary base pairing over at least part of their length; 
10 (ii) providing conditions conducive to the growth and differentiation of the cell 
treated in (i) above; and optionally 

(iii) maintaining and/or storing the cell in a differentiated state. 

In a preferred method of the invention said first and second parts are linked by at 
1 5 least one nucleotide base. 

The provision of first and second sequences which are complementary to one another 
and which comprise at least part of the coding sequence of a gene involved in stem 
cell differentiation means that when the sequence is transcribed into RNA the 

20 complementarity between first and second sequences allows base pairing between 
first and second sequences to form a double stranded RNA structure, see Figure 1. 
The optional provision of a linking region bewteen first and second parts results in 
the formation of a so called "hair-pin" loop structure. The transcription of the 
nucleic acid provides many copies of the hair-pin loop RNA which effectively 

25 functions as a RNAi molecule. 

In a preferred method of the invention said nucleic acid molecule is a stem loop RNA 
molecule. Alternatively, said nucleic acid molecule is a DNA molecule which 
encodes said stem loop RNA. Ideally said DNA molecule is a vector adapted for 
30 expression of said stem loop RNA. 

8 



WO 03/012082 



PCT/GB02/03409 



The stem cell in (i) above may be a teratocarcinoma cell. 

In a preferred method of the invention said conditions are in vitro cell culture 
conditions. 

5 

In a further preferred method of the invention said stem cell is selected from: 
pluripotent stem cells such as embryonic stem cell; embryonic germ cell and 
embryonal carcinoma cells; and lineage restricted stem cells such as, but not 
restricted to; haemopoietic stem cell; muscle stem cell; nerve stem cell; skin dermal 
1 0 sheath stem cell; liver stem cell; and teratocarcinoma cells. 

It will be apparent that the method can provide stem cells of intermediate 
commitment. For example, embryonic stem cells could be programmed to 
differentiate into haemopoietic stems cells with a restricted commitment. 
15 Alternatively, differentiated cells or stem cells of intermediate commitment could be 
reprogrammed to a more pluripotential state from which other differentiated cell 
lineages can be derived. 

In a further preferred method of the invention said stem cell is an embryonic stem 
20 cell or embryonic germ cell. 

In a yet further preferred method of the invention said stem loop RNA molecule is 
derived from a gene which encodes a cell surface receptor expressed by a stem cell. 

25 In a further preferred method of the invention said cell surface receptor is selected 
from: human Notch l(hNotch 1); hNotch 2; hNotch 3; hNotch 4; TLE-1; TLE-2; 
TLE-3; TLE-4; TCF7; TCF7L1; TCFFL2; TCF3; TCF19; TCF1; mFringe; lFringe; 
rFringe; sel 1; Numb; Numblike; LNX; FZD1; FZD2; FZD3; FZD4; FZD5; FZD6; 
FZD7; FZD8; FZD9; FZD10; FRZB. 

30 
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In an alternative preferred method of the invention said stem loop RNA molecule is 
derived from a gene which encodes a ligand. 

Typically, a ligand is a polypeptide which binds to a cognate receptor to induce or 
5 inhibit an intracellular or intercellular response. Ligands may be soluble or 
membrane bound. 

In a further alternative preferred method of the invention said ligand is selected from: 
DIM; D113; D114; Dlk-1; Jagged 1; Jagged 2; Wnt 1; Wnt 2; Wnt 2b; Wnt 3; Wnt 
10 3a; Wnt5a; Wnt6; Wnt7a; Wnt7b; Wnt8a; Wnt8b; WntlOb; Wntll; Wntl4; WntlS. 

Alternatively, said gene is selected from: SFRP1; SFRP2; SFRP4; SFRP5; SK; 
DKK3; CER1; W-l; DVL1; DVL2; DVL3; DVLlLl;mFringe; IFringe; rFringe; 
selll; Numb; LNX Oct4;NeuroDl; NeuroD2; NeuroD3; Brachyury; MDFI. 

15 

In a further preferred method of the invention said stem loop RNA molecule is 
derived from at least one of the sequences identified in Table 4 or Figures 4-54. 

In a yet futher preferred embodiment of the invention said sequence is derived from 
20 Oct 4. Preferably the Oct 4 sequence corresponds to nucleotide sequence about 610 
to about 1032 of the Oct 4 sequence found in GenBank accession number NM_ 
002701. 

Many methods have been developed over the last 30 years to facilitate the 
25 introduction of nucleic acid into cells which are well known in the art and are 
applicable to the stem loop RNA structures disclosed herein or the vectors which 
encode said stem loop structures. 

Methods to introduce nucleic acid into cells typically involve the use of chemical 
30 reagents, cationic lipids or physical methods. Chemical methods which facilitate the 
uptake of DNA by cells include the use of DEAE -Dextran ( Vaheri and Pagano 
Science 175: p434) . DEAE-dextran is a negatively charged cation which associates 
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and introduces the nucleic acid into cells. Calcium phosphate is also a commonly 
used chemical agent which when co-precipitated with nucleic acid introduces the 
nucleic acid into cells (Graham et al Virology (1973) 52: p456). 

5 The use of cationic lipids (eg liposomes ( Feigner (1987) Proc.Natl.Acad.Sci USA, 
84;p7413) has become a common method. The cationic head of the lipid associates 
with the negatively charged nucleic acid backbone to be introduced. The lipid/nucleic 
acid complex associates with the cell membrane and fuses with the cell to introduce 
the associated nucleic acid into the cell. Liposome mediated nucleic acid transfer has 
10 several advantages over existing methods. For example, cells which are recalcitrant 
to traditional chemical methods are more easily transfected using liposome mediated 
transfer. 

More recently still, physical methods to introduce nucleic acid have become effective 
15 means to reproducibly transfect cells. Direct microinjection is one such method 
which can deliver nucleic acid directly to the nucleus of a cell ( Capecchi (1980) 
Cell, 22:p479). This allows the analysis of single cell transfectants. So called 
"biolistic" methods physically shoot nucleic acid into cells and/or organelles using a 
particle gun ( Neumann (1982) EMBO J, 1: p841). Electroporation is arguably the 
20 most popular method to transfect nucleic acid. The method involves the use of a 
high voltage electrical charge to momentarily permeabilise cell membranes making 
them permeable to macromolecular complexes. 

More recently still a method termed immunoporation has become a recognised 
25 techinque for the introduction of nucleic acid into cells, see Bildirici et al Nature 
(2000) 405, p298. The technique involves the use of beads coated with an antibody 
to a specific receptor. The transfection mixture includes nucleic acid, antibody coated 
beads and cells expressing a specific cell surface receptor. The coated beads bind the 
cell surface receptor and when a shear force is applied to the cells the beads are 
30 stripped from the cell surface. During bead removal a transient hole is created 
through which nucleic acid and/or other biological molecules can enter. Transfection 
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efficiency of between 40-50% is achievable depending on the nucleic acid used. In 
addition the specificity of cell delivery of RNAi's can be enhanced by association or 
linkage of the RNAi to specific antibodies, ligands or receptors. 

5 There are also a number of commercially available transfection kits which purport to 
provide high efficiency transfection of cells. A kit which is particularly preferred is 
sold under the tradename ExGen 500^ by MBI Fermentas, Lithuania. ExGen is a 
polyethylenimine, non-liposomal transfection reagent. 

10 According to a further aspect of the invention there is provided a stem loop RNA 
molecule derived from a coding sequence of at least one gene involved in stem cell 
differentiation comprising a first part linked to a second part wherein said first and 
second parts are complementary over at least part of their length and further wherein, 
said first and second parts form a double stranded region by complementary base 

15 pairing over at least part of their length. 

In a preferred embodiment of the invention said first and second parts are linked by at 
least one nucleotide base. In a further preferred embodiment of the invention said 
first and second parts are linked by 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotide bases. In a 
20 yet further preferred embodiment of the invention said linker is at least 10 nucleotide 
bases. 

In a preferred embodiment said coding sequence is an exon. 

25 Alternatively said RNA molecule is derived from intronic sequences or the 5* and/or 
y non-coding sequences which flank coding/exon sequences of genes which mediate 
stem cell differentiation. 

In a further preferred embodiment of the invention the length of the RNA molecule is 
30 between 10 nucleotide bases (nb) -lOOOnb. More preferably still the length of the 
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RNA molecule is selected from lOnb; 20nb; 30nb; 40nb; 50nb; 60nb; 70nb; 80nb; 
90nb. More preferably still said RNA molecule is 21nb in length. 

In a further preferred embodiment of the invention said RNA molecule is lOOnb; 
5 200nb; 300nb; 400nb; 500nb; 600nb; 700nb; 800nb; 900nb; or lOOOnb. More 
preferably still said RNA molecule is at least lOOOnb. 

In a further preferred embodiment of the invention said RNA molecule comprises 
sequences identified in Table 4 or Figures 4-54. 

10 

In yet a further preferred embodiment of the invention said RNA molecules comprise 
modified nucleotide bases. 

It will be apparent to one skilled in the art that the inclusion of modified bases, as 
15 well as the naturally occuring bases cytosine, uracil, adenosine and guanosine, may 
confer advantageous properties on RNA molecules containing said modified bases. 
For example, modified bases may increase the stability of the RNA molecule thereby 
reducing the amount required to produce a desired effect. The provision of modified 
bases may also provide stem-loop structures which are more or less stable. 

20 

According to a further aspect of the invention there is provided a nucleic acid 
molecule encoding at least part of a gene which mediates at least one step in stem cell 
differentiation comprising a first part linked to a second part which first and second 
parts are complementary over at least part of their length, wherein said nucleic .acid 
25 molecule is operably linked to at least one further nucleic acid molecule capable of 
promoting transcription of said nucleic acid linked thereto and further wherein said 
first and second parts form a double stranded region by complementary base pairing 
over at least part of their length as or when said nucleic acid molecule is transcribed. 

30 In a preferred embodiment of the invention said first and second parts are linked by 
linking nucleotides as hereinbefore described. 

13 
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It will be apparent to one skilled in the art that the synthesis of RNA molecules which 
form RNA stem loops can be achieved by providing vectors which include target 
genes, or fragments of target genes, operably linked to promoter sequences. 
5 Typically, promoter sequences are phage RNA polymerase promoters (eg T7, T3, 
SP6). Advantageously vectors are provided with multiple cloning sites into which 
genes or gene fragments can be subcloned. Typically, vectors are engineered so that 
phage promoters flank multiple cloning sites containing the gene of interest. 

10 Alternatively target genes or fragments of target genes can be fused directly to phage 
promoters by creating chimeric promoter/gene fusions via oligo synthesising 
technology. Constructs thus created can be easily amplified by polymerase chain 
reaction to provide templates for the manufacture of RNA molecules comprising 
stem loop RNA's. 

15 

According to a further aspect of the invention there is provided a vector including an 
expression cassette comprising a first sequence linked to a second sequence wherein 
said first and second sequences are complementary over at least part of their lengths 
and further wherein the expression cassette is transciptionally linked to a promoter 
20 sequence. 

In a preferred embodiment of the invention said first and second parts are linked by 
hnking nucleotides as hereinbefore described. 

25 Vectors including expression cassettes encoding stem-loop RNA's are adapted for 
eukaryotic gene expression. Typically said adaptation includes, by example and not 
by way of limitation, the provision of transcription control sequences (promoter 
sequences) which mediate cell/tissue specific expression. These promoter sequences 
may be cell/tissue specific, inducible or constitutive. 

30 
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Promoter elements typically also include so called TATA box and RNA polymerase 
initiation selection sequences which function to select a site of transcription 
initiation. These sequences also bind polypeptides which function, inter alia, to 
facilitate transcription initiation selection by RNA polymerase. 

5 

Adaptations also include the provision* of selectable markers 
and autonomous replication sequences which both facilitate the maintenance of said 
vector in either the eukaryotic cell or prokaryotic host. Vectors which are maintained 
autonomously are referred to as episomal vectors. Further adaptations which 
10 facilitate the expression of vector encoded genes include the provision of 
transcription termination sequences. 

These adaptations are well known in the art. There is a significant amount of 
published literature with respect to expression vector construction and recombinant 
15 DNA techniques in general. Please see, Sambrook et al (1989) Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbour Laboratory, Cold Spring Harbour, NY and 
references therein; Marston, F (1987) DNA Cloning Techniques: A Practical 
Approach Vol ET IRL Press, Oxford UK; DNA Cloning: F M Ausubel et al, Current 
Protocols in Molecular Biology, John Wiley & Sons, Inc.(1994). 

20 

According to a further aspect of the invention there is provided a cell transfected with 
the nucleic acid or vector according to the invention. Preferably said cell is an 
embryonic stem cell or embryonic germ cell. Alternatively said cell is an embryonal 
carcinoma cell. 

25 

According to a further aspect of the invention there is provided a method to 
manufacture stem loop RNA molecules comprising: 

(i) providing a vector or promoter/gene fusion according to the invention; 

30 
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(ii) providing reagents and conditions which allow the synthesis of the RNA 
molecule comprising a stem loop RNA molecule according to the invention; and 

(iii) providing conditions which allow the RNA molecule to base pair over at least 
5 part of its length, or at least that part corresponding to the nucleic acid sequence 

encoding said stem cell gene which mediates stem cell differentiation. 

Preferably said gene, or gene fragment is selected from those genes represented in 
table 4 or Figures 4-54. 

10 

In vitro transcription of RNA is an established methodology. Kits are commercially 
available which provide vectors, ribonucleoside triphosphates, buffers, Rnase 
inhibitors, RNA polymerases (eg phage T7, T3, SP6) which facilitate the production 
of RNA. 

15 

According to a further aspect of the invention there is provided an in vivo method to 
promote the differentiation of stem cells comprising administering to an animal an 
effective amount of stem loop RNA molecule, or vector encoding a stem loop RNA 
molecule according to the invention, sufficient to effect differentiation of a target 
20 stem cell. 



Preferably said method promotes differentiation in vivo of endogenous stem cells to 
repair tissue damage in situ. 

25 It will be apparent to one skilled in the art that stem loop RNA relies on homology 
between the target gene RNA and double stranded region of the stem loop in a 
similar way to conventional RNAi. This confers a significant degree of specificity to 
the stem loop RNA molecule in targeting stem cells. For example, haemopoietic 
. stem cells are found in bone marrow and stem loop RNA molecules may be 

30 administered to an animal by direct injection into bone marrow tissue. 
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Stem loop RNA molecules may be encapsulated in liposomes to provide protection 
from an animals immune system and/or nucleases present in an animals serum. 

Liposomes are lipid based vesicles which encapsulate a selected therapeutic agent 
5 which is then introduced into a patient. Typically, the liposome is manufactured 
either from pure phospholipid or a mixture of phospholipid and phosphoglyceride. 
Typically liposomes can be manufactured with diameters of less than 200nm, this 
enables them to be intravenously injected and able to pass through the pulmonary 
capillary bed. Furthermore the biochemical nature of liposomes confers 
10 permeability across blood vessel membranes to gain access to selected tissues. 
Liposomes do have a relatively short half-life. So called STEALTH R liposomes have 
been developed which comprise liposomes coated in polyethylene glycol (PEG). The 
PEG treated liposomes have a significantly increased half-life when administered 
intravenously to a patient. In addition STEALTH R liposomes show reduced uptake 
15 in the reticuloendothelial system and enhanced accumulation selected tissues. In 
addition, so called immuno-liposomes have been develop which combine lipid based 
vesicles with an antibody or antibodies, to increase the specificity of the delivery of 
the RNAi molecule to a selected cell/tissue. 

The use of liposomes as delivery means is described in US55 80575 and US 5542935. 

It will be apparent to one skilled in the art that the stem loop RNA molecules can be 
provided in the form of an oral or nasal spray, an aerosol, suspension, emulsion, 
and/or eye drop fluid. Alternatively the stem loop RNA molecules may be provided 
in tablet form. Alternative delivery means include inhalers or nebulisers. 

According to a yet further aspect of the invention there is provided a therapeutic 
composition comprising a stem loop RNA molecule according to the invention or a 
vector encoding a stem loop RNA according to the invention. 

Preferably said stem loop RNA molecule or vector is for use in the manufacture of a 
medicament for use in promoting the differentiation of stem cells to provide 
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differentiated cells/tissues to treat diseases where cell/tissues are destroyed by said 
disease. 

Typically this includes pernicious anemia; stroke, neurodegenerative diseases such as 
5 Parkinson's disease, Alzhiemer's disease; coronary heart disease; cirrhosis; 
diabetes. It will also be apparent that differentiated stem cells may be used to replace 
nerves damaged as a consequence of (eg replacement of spinal cord tissue). 

In a further preferred embodiment of the invention said therapeutic composition 
10 further comprises a diluent, carrier or excipient. 

According to a further aspect of the invention there is provided a cell obtainable by 
the method according to the invention. 

15 It will be apparent that a cell obtainable by the method according to the invention has 
useful applications . For example, a stably transfected cell under the control of a 
regulatable promoter (ie inducible, repressible, developmentally regulated, cell 
lineage regulated, cell-cycle regulated) offers the opportunity to modulate the 
expression of the stem-loop RNA in said cell thereby modulating the differentiation 

20 state, or not as the case maybe, in culture or in viyo. 

According to a yet further aspect of the invention there is provided at least one organ 
comprising at least one cell obtainable by the method according to the invention. 

25 According to a yet further aspect of the invention there is provided a non-human ■ 
transgenic animal comprising a RNA molecule according to the invention, or a 
nucleic acid molecule according to the invention, or a vector according to the 
invention. 

30 An embodiment of the invention will now be described by example only and with 
reference to the following figures and tables wherein: 
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Table 1 represents a selection of antibodies used to monitor stem cell differentiation; 

Table 2 represents nucleic acid probes used to assess mRNA markers of stem 
5 differentiation; 

Table 3 represents protein markers of stem cell differentiation; 

10 Table 4 represents specific primers used to generate stem loop KNA for gene 
specific inhibition; 

Table 5 represents vectors used for the expression of stem loop RNA in cells 
including the promoters used to drive transcription of stem loop RNA's. 

15 

Figure 1 illustrates stem cell differentiation is controlled by positive and negative 
regulators (A). The specific cell phenotypes that are derived are a direct result of 
positive and negative regulators which activate or suppress particular differentiation 
events. Stem loop RNA can be used to control both the initial differentiation of stem 
20 cells (A) and the ultimate fate of the differentiated cells Dl and D2 by repression of 
positive activators which would normally promote a particular cell fate; 

Figure 2 represents the Oct 4 nucleic acid sequence from position 610-1032 of the 
sequence found in GenBank accession number NM_002701. 

25 

Fig 3A illustrates a transcription cassette comprising a promoter sequence operable 
linked to a nucleic acid encoding a stem loop RNA; Fig 3B illustrates a stem loop 
RNA synthesised from the cassette illustrated in Fig 1 A; 

30 Figure 4 is the nucleic acid sequence of murine notch ligand delta-like 1 ; 

Figure 5 is the nucleic acid sequence of murine notch ligand jagged 1; 
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Figure 6 is the nucleic acid sequence of human notch ligand jagged 1 (alagille 
syndrome) (JAG1); 

Figure 7 is the nucleic acid sequence of human notch ligand jagged 2 (JAG2) 

5 

Figure 8 is the nucleic acid sequence of murine notch ligand jagged 2; 
Figure 9 is the nucleic acid sequence of human notch ligand delta-like 3 (DLL3); 
10 Figure 10 is the nucleic acid sequence of human notch ligand delta-1 (DLL1); 

Figure 1 1 is the nucleic acid sequence of human notch ligand delta-like 4 (DLL4); 
Figure 12 is the nucleic acid sequence of murine notch ligand delta-like 4(DLL4); 

15 

Figure 13 represents the nucleic acid sequence of human Wnt 13; 
Figure 14 represents the nucleic acid sequence of human dickkopfl; 
20 Figure 15 represents the nucleic acid sequence of human dickkopf2\ 

Figure 16 represents the nucleic acid sequence of human dickkop/3; and 
Figure 17 represents the nucleic acid sequence of human dickkopf4\ 

25 

Figure 18 represents the nucleic acid sequence of WNT-1; 
Figure 19 represents the nucleic acid sequence of WNT-2; 
30 Figure 20 represents the nucleic acid sequence of WNT 2B; 

20 
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Figure 21 represents the nucleic acid sequence of WNT 3; 
Figure 22 represents the nucleic acid sequence of WNT 4; 
5 Figure 23 represents the nucleic acid sequence of WNT 5 A; 
Figure 24 represents the nucleic acid sequence of WNT 6; 
Figure 25 represents the nucleic acid sequence of WNT 7A; 

10 

Figure 26 represents the nucleic acid sequence of WNT 8B; 
Figure 27 represents the nucleic acid sequence of WNT 10B; 
15 Figure 28 represents the nucleic acid sequence of WNT 11; 
Figure 29 represents the .nucleic acid sequence of WNT 14 
Figure 30 represents the nucleic acid sequence of WNT 16; 

20 

Figure 31 represents the nucleic acid sequence of FZD 1; 
Figure 32 represents the nucleic acid sequence of FZD 2; 
25 Figure 33 represents the nucleic acid sequence of FZE 3; 
Figure 34 represents the nucleic acid sequence of FZD 4; 
Figure 35 represents the nucleic acid sequence of FZD 5; 

30 

Figure 36 represents the nucleic acid sequence of FZD 6; 
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Figure 37 represents the nucleic acid sequence of FZD 7; 
Figure 38 represents the nucleic acid sequence of FZD 8; 

5 

Figure 39 represents the nucleic acid sequence of FZD 9; 
Figure 40 represents the nucleic acid sequence of FZD 10; 
10 Figure 41 represents the nucleic acid sequence of FRP; 
Figure 42 represents the nucleic acid sequence of SARP 1; 
Figure 43 represents the nucleic acid sequence of SARP 2; 

15 

Figure 44 represents the nucleic acid sequence of FRZB; 
Figure 45 represents the nucleic acid sequence of FRPHE; 
20 Figure 46 represents the nucleic acid sequence of SARP 3; 
Figure 47 represents the nucleic acid sequence of CER 1; 
Figure 48 represents the nucleic acid sequence of DKK1; 

25 

Figure 49 represents the nucleic acid sequence of DKK 2; 
Figure 50 represents the nucleic acid sequence of DKK 3; 
30 Figure 5 1 represents the nucleic acid sequence of DKK 4; 
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Figure 52represents the nucleic acid sequence of WIF-1; 
Figure 53 represents the nucleic acid sequence of SRFP 1; 
5 Figure 54 represents the nucleic acid sequence of SRFP 4; 



10 



15 Materials and Methods 
Cell Culture 

NTERA2 and 2102Ep human EC cell lines were maintained at high cell density as 
previously described (Andrews et al 1982, 1984b), in DMEM (high glucose 
20 formulation) (DMEM)(GIBCO BRL), supplemented with 10% v/v bovine foetal calf 
serum (GB3CO BRL), under a humidified atmosphere with 10% C0 2 in air. 

Stem Loop RNA Production 

25 Primers were designed against specific target genes with T7 bacteriophage promoters 
at their 5 5 ends . The primers consist of typically 18- 25 bp against the target gene, a 
linker sequence of variable length (indicated by N in primer sequence) followed by 
the reverse complement of the gene specific sequence. The primers were used in a 
standard RNA in vitro, transcription reaction using a MEGASCRIPT kit following 

30 manufacturers protocols (Ambion, USA). Longer slRNA templates were produced 
buy cloning head-to -tail the sense and anti-sense gene specific sequences to generate 
a palindromic template from which RNA could be synthesized. 

The following primers were used 

35 
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Gene 


Accession 
Number 


Primer Sequence 


Oct4 


Z11899 


TAA TAC GAC TCA CTA TAG 
Ggagcagcttgggctcgagaag(N)cttctcgagcccaagctgctc 


HsNotch2 




TAA TAC GAC TCA CTA TAGGt cgt gca aga gcc 
agt tac cc(N)gg gta act ggc tct tgcacg a 


HsNotchl 


M73980 


TAA TAC GAC TCA CTA TAGGa atg gtc aat gcg 
agt ggc tgt cc(N)gg aca gcc act cgc gtt gac cat t | 


CIF 




TAA TAC GAC TCA CTA TAGGa gta gtg aga gtg 
aga gta aca(N)tgt tac tct cac tct cac tac t 


RBPJ-kappa 




TAA TAC GAC TCA CTA TAGGt cctgtg cctgtg gta 
gag a(N)t etc tac cac agg cac agg a 


Dlkl 


NM_002226 


TAA TAC GAC TCA CTA TAGGcctc ttg etc ctg ctg 
get tt(N)aaagccagcaggagcaagagg 



Capital letters indicate the T7 polymerase promoter sequence. 

5 In each case, a quantity of the PCR was electrophoresed through agarose to verify 
product size and abundance, whilst the remainder was purified by alkaline 
phenol/chloroform extraction. RNA was synthesized using the Megascript kit 
(Ambion Inc.) according to the manufacturer's protocol and acid phenol/chloroform 
extracted. The simultaneous synthesis of complementary strands of RNA in a single 
10 reaction circumvents the requirement for an annealing step. However, the quality and 
duplexing of the synthesized RNA was confirmed by agarose gel electrophoresis, 
with the desired products migrating as expected for double stranded DNA of the 
same length. 

15 Stem Loop RNA introduction to Ceil Lines 

Human EC stem cells were seeded at 2 XI 0 5 cells/well of a 6 well plate in 3 cm 3 of 
Dulbecco's modified Eagles medium and allowed to settle for 3 hrs. 
Appx. 9.5|ig of DNA was incubated with an optimised amount of ExGEN 500 for 
20 each well of a 6-well plate. Previously cells were seeded 1 day before. This gives 
apprx. a 70% confluent culture. The DNA/ExGen mixture was added to the cells and 
the culture vessel spun at 280g for 5 mins. 

Total RNA production 
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Growing cultures of cells were aspirated to remove the DME and foetal calf serum. 
Trace amounts of foetal calf serum was removed by washing in Phosphate-buffered 
saline. Fresh PBS was added to the cells and the cells were dislodged from the 
5 culture vessel using acid washed glass beads. The resulting cell suspension was 
centrifuged at 300xg. The pellets had the PBS aspirated from them. Tri reagent 
(Sigma, USA) was added at 1ml per 10 7 cells and allowed to stand for 10 mins at 
room temperature. The lysate from this reaction was centrifuged at 12000 x g for 15 
minutes at 4°C. The resulting aqueous phase was transferred to a fresh vessel and 
10 0.5 ml of isopropanol / ml of trizol was added to precipitate the RNA. The RNA was 
pelleted by centrifugation at 12000 x g for 10 mins at 4°C. The supernatant was 
removed and the pellet washed in 70% ethanol. The washed RNA was dissolved in 
DEPC treated double-distilled water. 

15 Analysis of the differentiation of EC stem cells induced by exposure to Stem Loop 
RNA 

Following exposure to stem loop RNA corresponding to specific key regulatory 
genes, the subsequent differentiation of the EC cells was monitored in a variety of 
20 ways. One approach was to monitor the disapearance of typical markers of the stem 
cell phenotype; the other was to monitor the appearance of markers pertinent to the 
specific lineages induced. The relevant markers included surface antigens, mRNA 
species and specific proteins. 

25 Analysis of Transfectants by Antibody Staining and FACS 

Cells were treated with trypsin (0.25% v/v) for 5 mins to disaggregate the cells; they 
were washed and re-suspended to 2x1 0 5 cells/ml. This cell suspension was incubated 
with 50fxl of primary antibody in a 96 well plate on a rotary shaker for 1 hour at 4°C. 
30 Supernatant from a myeloma cell line P3X63Ag8, was used as a negative control. 
The 96 well plate was centrifuged at lOOrpm for 3 minutes. The plate was washed 3 
times with PBS containing 5% foetal calf serum to remove unbound antibody. Cell 
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were then incubated with 50 \i\ of an appropriate FITC-conjugated secondary 
antibody at 4°C for 1 hour. Cells were washed 3 times in PBS + 5% foetal calf 
serum and analysed using an EPICS elite ESP flow cytometer (Coulter eletronics, 
U.K).(Andrews et al., 1982) 

5 

Northern blot Analysis of RNA 

RNA separation relies on the generally the same principles as standard DNA but with 
some concessions to the tendancy of RNA to hybridise with itself or other RNA 
molecules. Formaldehyde is used in the gel matrix to react with the amine groups of 

10 the RNA and form Schiff bases. Purified RNA is run out using standard agarose gel 
electrophoresis. For most RNA a 1% agarose gel is sufficiant. The agarose is made in 
IX MOPS buffer and supplemeted with 0.66M formaldehyde.Dryed down RNA 
samples are reconstituted and denatured in RNA loading buffer and loaded into the 
gel. Gels are run out for apprx. 3 hrs (until the dye front is 3/4 of the way down the 

15 gel). 

The major problem with obtaining clean blotting using RNA is the presence of 
formaldehyde. The run out gel was soaked in distilled water for 20 mins with 4 
changes, to remove the formaldehyde from the matrix. The transfer assembly was 

20 assembled in exactly the same fashion as for DNA (Southern) blotting.The transfer 
buffer used however was 10X SSPE. Gels were transfered overnight. The membrane 
was soaked in 2X SSPE to remove any agarose from the transfer assembly and the 
RNA was fixed to the memebrane. Fixation was acheived using short-wave (254 nM) 
UV light. The fixed membrane was baked for 1-2 hrs to drive off any residual 

25 formaldehyde. 

Hybridisation was acheived in aqueous phase with formamide to lower the 
hybridisation temperatures for a given probe. RNA blots were prehybridised for 2-4 
hrs in northern prehybridisation soloution. Labelled DNA probes were denatured at 
30 95°C for 5 mins and added to the blots. All hybridisation steps were carried out in 
rolling bottles in incubation ovens. Probes were hybridised overnight for at least 16 
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hrs in the prehybridisation soloution. A standard set of wash, soloutions were used. 
Stringency of washing was acheived by the use of lower salt containing wash buffers. 
The following wash procedure is outlined as follows 



2XSSPE 15 mins 

2XSSPE 15mins 

2X SSPE/ 0.1% SDS 45 mins 

2X 8SPE/0.1%SDS 45 mins 

0.1XSSPE 15 mins 



room temp 
room temp 
65°C 
65°C 

room temp 



10 Preparation of radiolabelled DNA probes 

The method of Feinberg and Vogelstein (Feinberg and Vogelstein, 1983) was used to 
radioactively label DNA. Briefly, the protocol uses random sequence hexanucleotides 
to prime DNA synthesis at numerous sites on a denatured DNA template using the 

15 Klenow DNA polymerase I fragment. Pre-formed kits were used to aid consistency . 
5-100ng DNA fragment (obtained from gel purifcation of PCR or restriction digests) 
was made up in water,denatured for 5 mins at 95°C with the random hexamers. The 
mixture was quench cooled on ice and the following were added, 
5 jal [a-32P] dATP 3000 Ci/mmol 

20 1 jil of Klenow DNA polymerase (4U) 

The reaction was then incubated at 37°C for 1 hr. Unincorporated nucleotide were 
removed with spin columns ( Nucleon Biosciences). 



Production of cDNA 

25 

The enzymatic conversion of RNA into single stranded cDNA was achieyed using 
the 3' to 5' polymerase activity of recombinant Moloney-Murine Leukemia Virus 
(M-MLV) reverse transcriptase primed with oligo (dT) and (dN) primers. For 
Reverse Transcription-Polymerase Chain Reaction, single stranded cDNA was used. 
30 cDNA was synthesised from l^g poly (A)+ RNA or total RNA was incubated with 
the following 

1 .0|jM oligo(dT) primer for total RNA or random hexcamers for mRNA 
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0.5mM 1 OmM dNTP mix 

lU/(al RNAse inhibitor (Promega) 

l.OU/fil M-MLV reverse transcriptase in manufacturers supplied buffer 
(Promega) 

5 The reaction was incubated for 2-3 hours at 42°C 
Fluorescent Automated Sequencing 

To check the specificity of the PCR primers used to generate the template used in 
stem loop RNA production automatic sequencing was carried out using the prism 

10 fluorescently labelled chain terminator sequencing kit (Perkin-Elmer) (Prober et al 
1987). A suitable amount of template (200ng plasmid, lOOng PCR product), 10 pM 
sequencing primer (typically a 20mer with 50% G-C content) were added to 8 ^1 of 
prism pre-mix and the total reaction volume made up to 20 faL 24 cycles of PCR 
(94°C for 10 seconds, 50°C for 10 seconds, 60°C for 4 minutes). Following thermal 

15 cycling, products were precipitated by the addition of 2fil of 3M sodium acetate and 
50 jaI of 100 % ethanol. DNA was pelleted in an Eppendorf microcentrifuge at 13000 
rpm, washed once in 70% ethanol and vacuum dried. Samples were analysed by the 
in-house sequencing Service (Krebs Institute). Dried down samples were 
resuspended in 4 jxl of formamide loading buffer, denatured and loaded onto a ABI 

20 373 automatic sequencer. Raw sequence was collected and analysed using the ABI 
prism software and the results were supplied in the form of analysed histogram 
traces. c 

Detection of specific protein targets by SDS-PAGE and Western Blotting 

25 

To obtain cell lysates monolayers of cells were rinsed 3 times with ice-cold PBS 
supplemented with 2 mM CaCk- Cells were incubated with 1 ml/75 cm 2 flask lysis 
buffer (1% v/v NP40, 1% v/v DOC, 0.1 mM PMSF in PBS) for 15 min at 4° C. Cell 
lysates were transferred to eppendorf tubes and passed through a 21 gauge needle to 
30 shear the DNA. This was followed by freeze thawing and subsequent centrifugation 
(30 min, 4°C, 15000g) to remove insoluble material. Protein concentrations of the 
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supernatants were determined using a commercial protein assay (Biorad). Samples 
were prepared for SDS-PAGE by adding 6 times Laemmli electrophoresis sample 
buffer and boiling for 5 min. After electrophoresis with 16 jxg of protein on a 10% 
polyacrylamide gel (Laemmli, 1970) the proteins were transferred to PVDF 
5 membrane. The blots were washed with PBS and 0.05% Tween (PBS-T). Blocking 
of the blots occurred in 5% milk powder in PBS-T (60 min, at RT). Blots were 
incubated with the appropriate primary antibody. Horseradish peroxidase labelled 
secondary antibody was used to visualise antibody binding by ECL (Amersham, 
Bucks., UK). Materials used for SDS-PAGE and western blotting were obtained from 
10 Biorad (California, USA) unless stated otherwise. 



Table 1: Antibodies used to detect stem cell differentiation 



Antibody 


Class 


Species 


Cell 

phenotype 
detected 


Changes on 

Differentiatio 

n 


Reference 


TRA-1- 
60 


IgM 


Mouse 


Human EC, 
ES cells. 


I 

differentiation 


Andrews et.al., 
1984a 


TRA-1- 
81 


IgM 


Mouse 


Human EC, 
ES cells. 


i 

differentiation 


Andrews et. 
al.,1984a 


SSEA3 


IgM 


Rat 


Human EC, 
ES cells. 


I 

differentiation 


Shevinsky et al 
1982, Fenderson 
et al 1987 


SSEA4 


IgG 


Mouse 


Human EC, 
ES cells. ■ 


I 

differentiation 


Kannagi et al 
1983 Fenderson 
et al 1987 


A2B5 


IgM 


Mouse 




t 

differentiation 


Fenderson et al 
1987 


ME311 


IgG 


Mouse 




t 

differentiation 


Fenderson et al 
1987 


VIN-IS- 
56 


IgM 


Mouse 




t 

differentiation 


Andrews et al 
1990 


. VIN-IS- 
53 


IgG 


Mouse 




f 

differentiation 


Andrews et al 
1990 















15 

Table 2: Probes used to assess mRNA markers of differentiation 
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Gene 


Cell Type 


Synaptophysin 


Neuron 


NeuroDl 


Neuron 


MyoDl 


Muscle 


Collagens 


Cartlidge 


Alpha-actin 


Skeletal muscle 


Smooth-muscle actin 


Smooth muscle 



5 



10 



Table 3: Protein markers of differentiation, detected by Western Blot and/or 
immunofluorescence. 

15 The following antibodies were detected by the appropriate commercially 
available antibodies 



Cell Type 


Antigen 


Neurons 


Neurofilaments 


Glial cells 


GFAP 


Epithelial cells 


Cytokeratins 


Mesenchymal cells 


Vimentin 


Muscle 


Desmin 


Muscle 


Tissue specific actins 


Connective tissue cells 


Collagens 
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Table 4: Specific Primers used to generate Stem Loop RNA for gene specific 
inhibition 



5 All sequences written 5' to 3' 





Gene Name 


Accession 
number 


PCR primer Sequences 


Position 


Notch Pathway 


Ligands: 










Dll-1 


AF003522 








D113 


NM_016941 








D114 


NM_019454 








Dlk-1 


NM_003836 








Jaggedl 


U73936 








Jagged2 


NM_002226 






Receptors: 










Notchl 


M73980 


gcggccgcctttgtggttctgttc 
gccggcgcgtcctcctcttcc 


5224-5726 




Notch2 


In-house 
sequence 


gccagaatgatgctacctgt 
tagagcagcaccaatggaac 






Notch3 


U97669 


Aagttacccccaagaggcaagtgtt 
Aaggaaatgagaggccagaagga 
ga 


7013-7348 




Notch4 


U95299 


ggctgcccctcccactctcg 
cagcccgggccccaggatag 


3727-4132 


Downstream: 










TLE-1 


NM_005077 








TLE-2 


M99436 








TLE-3 


M99438 








TLE-4 


M99439 
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TCF7 


NM_003202 








TCFFL2 


Y11306 








TCF3 


M31523 








TCF19 


NM_007109 








TCF1 


NM_000545 








infringe 


NM_002405 








lfringe 


U94354 








rFringe 


AF108139 








Sell 


AF157516 








Numb 


NM_003744 








LNX 


NM_010727 






Wingless Pathway 








Ligands 










Wntl 


NM 005430 








Wnt2 


NM_003391 








Wnt2B 


NM_004185 


tgagtggttcctgtactctg 
actcacactgggtaacacgg 


1159-1503 




Wnt5A 


L20861 








Wnt6 


AF079522 








Wnt7A 


NM_004625 








Wnt8B 


NM_003393 








WntlOB 


NM_003394 








Wntll 


NM_004626 








Wntl4 


AF028702 








Wntl5 


AF028703 








Wntl 6 


AF 169963 






Receptors 










FZD1 


NM_003505 








FZD2 


NM_001466 


tacccagagcggcctatcattttt 


955-1439 
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acgaagccggccaggaggaagga 
c 






FZD3 


NM_017412 








FZD4 


NM_012193 








FZD5 


NM_003468 








FZD6 


NM_003506 


Tggcctgaggagcttgaatgtgac 
Atcgcccagcaaaaatccaatgaa 


607-1026 




FZD7 


NM_003507 








FZD8 


AA481448 








FZD9 


NM_003508 








FZD10 


NM_007197 








FRZB 


NM_001463 






Extracellular 
Effectors 












SFRP1 


NM_003012 








SFRP2 


AF017986 








SFRP4 


AF026692 


agaggagtggctgcaatgaggtc 
gcgcccggctgttttctt 


877-1178 




SFRP5 


NM.003015 








SK 


AB020315 








CER1 


NM_005454 








WIF-1 


NM_007191 








DVL1 


U46461 








DVL2 


NM 004422 








DVL3 


\ti it r\r\ a /loo 

NM_004423 






' 11 Von cnfinfinn 1 

jl rails crip uoii j 












Oct4 


Z11899 








Brachyury 


NMJ)03181 
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NeuroDl 


NM_002500 








NeuroD2 


NM_006160 








NeuroD3 


U63842 








MyoD 


NM_002478 








MDFI 


NM_005586 








REST 


NM_005612 



























Table 5 

5 Listed are examples of vector systems that are to be used in cells to direct the 
production of stem loop RNA. 



Expression System 


Vectors 


Accession numbers 


Promoters 


Tet-on/Tet-off 

Clontech, USA 


pTet-on 

pTet-off 

pTRE2-Hyg 


U89930 
U89929 


CMV 

MyoDl 

NeuroDl 

Oct4 

GATAl 

Beta-actin 

PGK 


IRES 

Invitrogen, 
Nethelands) 


pIRES-EGFP 




CMV 

MyoDl 

NeuroDl 

Oct4 

GATAl 

Beta-actin 

PGK 
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CLAIMS 

1 . A method to modulate the differentiation state of a stem cell comprising: 
i) contacting a stem cell with at least one nucleic acid molecule comprising a 
sequence of a gene which mediates at least one step in the differentiation of said cell 
which nucleic acid molecule consists of a first part linked to a second part wherein 
said first and second parts are complementary over at least part of their length and 
further wherein said first and second parts form a double stranded region by 
complementary base pairing over at least part of their length; 

(ii) providing conditions conducive to the growth and differentiation of the cell 
treated in (i) above; and optionally 

(iii) maintaining and/or storing the cell in a differentiated state. 

2. A method according to Claim 1 wherein said first and second parts are linked 
by at least one nucleotide base. 

3 A method according to Claim 1 or 2 wherein said nucleic acid molecule is a 
stem loop RNA molecule or a nucleic acid molecule or a vector encoding said stem 
loop RNA. 

4. A method according to any of Claims 1-3 wherein said conditions are in vitro 
cell culture conditions. 

5. A method according to any of Claims 1-4 wherein said stem cell is selected 
from the group consisting of: an embryonic stem cell; an embryonic germ cell; an 
embryonal carcinoma cell; a haemopoietic stem cell; a muscle stem cell; a nerve 
stem cell; a skin dermal sheath stem cell; a liver stem cell; a teratocarcinoma cell. 

6. A method according to any of Claims 1-5 wherein said stem cell is an 
embryonic stem cell or embryonic germ cell. 

39 

SUBSTITUTE SHEET (RULE 26) 



WO 03/012082 



PCT/GB02/03409 



7. A method according to any of Claims 1-6 wherein said nucleic acid molecule 
is derived from at least one nucleic acid sequence as represented by Figures 4- 54. 

8. A RNA molecule derived from a coding sequence of at least one gene 
5 involved in stem cell differentiation comprising a first part linked to a second part 

wherein said first and second parts are complementary over at least part of their 
length and further wherein said first and second parts form a double stranded region 
by complementary base pairing over at least part of their length. 

10 9. A RNA molecule according to Claim 8 wherein said first and second parts 
are linked by at least one nucleotide base (nb). 

10. A RNA molecule according to Claim 9 wherein said first and second parts 
are linked by 2, 3, 4, 5, 6, 7, 8, 9, or lOnb in length. 

15 

11. A RNA molecule according to Claim 9 wherein said linker is at least lOnb in 
length. 

12. A RNA molecule according to any of Claims 8-11 wherein the length of the 
20 RNA molecule is between lOnb -lOOOnb in length. 

13. A RNA molecule according to Claim 1 2 wherein the length of the RNA 
molecule is selected from lOnb; 20nb; 30nb; 40nb; 50nb; 60nb; 70nb; 80nb; 90nb in 
length. 

25 

14. A RNA molecule according to Claim 12 wherein said RNA molecule is 
lOOnb; 200nb; 300nb; 400nb; 500nb; 600nb; 700nb; 800nb; 900nb; or lOOOnb in 
length. 

30 15. A RNA molecule according to Claim 8 wherein said RNA molecule is at 
least lOOOnb in length. 
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16. A RNA molecule according to Claim 8 wherein said RNA molecule is 

21nb in length. 

5 17. A RNA molecule according to any of Claims 8-16 wherein said RNA 

molecule comprises sequences identified in Figures 4-54. 

18. A RNA molecule according to any of Claims 8-17 wherein said RNA 
molecules comprise modified nucleotide bases. 

10 

19. A nucleic acid molecule which encodes an RNA molecule according to any of 
Claims 8-18 wherein said nucleic acid molecule is operably linked to at least one 
further nucleic acid molecule capable of promoting transcription of said nucleic acid 
linked thereto. 

15 

20. A nucleic acid molecule according to Claim 19 wherein said further nucleic 
acid molecule is a promoter capable of inducible transcription. 

21 . A vector including a nucleic acid molecule according to Claim 19 or 20. 

20 

22. A cell transfected with an RNA molecule according to any of Claims 8-18, 
nucleic acid molecule according to Claim 19 or 20 or a vector according to Claim 
21. 

25 23. A cell according to Claim 22 wherein said cell is an embryonic stem cell or 
embryonic germ cell. 

24. A cell according to Claim 22 wherein said cell is an embryonal carcinoma 
cell. 

30 

25. A method to manufacture stem loop RNA molecules comprising: 
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(i) providing a nucleic acid molecule according to Claim 19 or 20 or a vector 
according to Claim 21; 

(ii) providing reagents and conditions which allow the synthesis of the RNA 
molecule comprising a RNA molecule according to any of Claims 8-18; and 

(iii) providing conditions which allow the RNA molecule to base pair over at least 
part of its length, or at least that part corresponding to the nucleic acid sequence 
encoding said stem cell gene which mediates stem cell differentiation. 

26. An in vivo method to promote the differentiation of stem cells comprising 
administering to an animal an effective amount of an RNA molecule according to any 
of Claims '8-18, a nucleic acid molecule according to Claim 19 or 20 or a vector 
according to Claim 21, sufficient to effect differentiation of a target stem cell. 

27. A RNA molecule according to any of Claims 8-18, a nucleic acid molecule 
according to Claim 19 or 20 or a vector according to Claim 21 for use as a 
pharmaceutical. 

28. A pharmaceutical composition comprising a RNA molecule according to any 
of Claims 8-18, a nucleic acid molecule according to Claim 19 or 20 or a vector 
according to Claim 21. 

29. Use of a RNA molecule according to any of Claims 8-18, a nucleic acid 
molecule according to Claim 19 or 20 or a vector according to Claim 21 for the 
manufacture of a medicament for use in promoting the differentiation of stem cells to 
provide differentiated cells/tissues to treat diseases where cell/tissues are destroyed 
by said disease. 
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30 Use according to Claim 29 wherein said disease is selected from the group 
consisting of: pernicious anemia; stroke, neurodegenerative diseases such as 
Parkinson's disease, Alzhiemer's disease; coronary heart disease; cirrhosis; 
diabetes; nerves damaged as a consequence of trauma (e.g. replacement of spinal 
5 cord tissue). 

31. A cell obtainable by the method according to any of Claims 1 -7. 

32. An organ comprising at least one cell according to Claim 3 1 . 

10 

33 . A non-human transgenic animal comprising a RNA molecule according to 
any of Claims 8-18, or a nucleic acid molecule according to Claim 19 or 20, or a 
vector according to Claim 21. 
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AGCAGCTTGGGCTCGAGAAGGATGTGGTCCGAGTGTGGTTCTGTAACCGGCGCCAG 

AAGGGCAAGCGATCAAGCAGCGACTATGCACAACGAGAGGATTTTGAGGCTGCTGG 

GTCTCCTTTCTCAGGGGGACCAGTGTCCTTTCCTCTGGCCCCAGGGCCCCATTTTGGT 

GCCCCAGGCTATGGGAGCCCTCACTTCACTGCACTGTACTCCTCGGTCCCTTTCCCTG 

AGGGGGAAGCCTTTCCCCCTGTCTCTGTCACCACTCTGGGCTCTCCCTTGCATTCAAA 

CTGAGGTGCCTGCCTGCCCTTCTAGGAATGGGGGACAGGGGGAGGGGAGGAGCTAG 

GGAAAGAAAACCTGGAGTTTGTGCCAGGGTTTTTGGATTAAGTTCTTCATTCACTAA 

GGAAGGAATTGGGAACACAAAGGG 
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Figure 3 

GTCCAGCGGTACCATGGGCCGTCGGAGCGCGCTAGCCCTTGCCGTGGTCTCTGCCCTGCTGTGC 

CAGGTCTGGAGCrCCGGCGTATTrGAGCTGAAGCTGCAGGAGTTCGTCAACAAGAAGGGGCTG 

CTGGGGAACCGCAACTGCTGCCGCGGGGGCTCTGGCCCGCCTrGCGCCTGCAGGACCnrrCTTTC 

GCGTATGCCTCAAGCACTACCAGGCCAGCGTGTCACCGGAGCCACCCTGCACCTACGGCAGTG 

CTGTCACGCCAGTGCTGGGTGTCGACTCCTTCAGCCTGCCTGATGGCGCAGGCATCGACCCCGC 

CTTCAGCAACCCCATCCGATTCCCCnTCGGCTTCACCTGGCCAGGTACCTTCTCTCT 

AAGCCCTCCATACAGACTCTCCCGATGACCTCGCAACAGAAAACCCAGAAAGACTCATCAGCC 

GCCTGACCACACAGAGGCACCTCACTGTGGGAGAAGAATGGTCTCAGGACCTTCACAGTAGCG 

GCCGCACAGACCTCCGGTACTCTTACCGGTTTGTGTGTGACGAGCACTACTACGGAGAAGGTTG 

CTCTGTGTTCTGCCGACCrCGGGATGACGCCTTTGGCCACTTCACCTGCGGGGACAGAGGGGAG 

AAGATGTGCGACCCTGGCTGGAAAGGCCAGTACTGCACTGACCCAATCTGTCTGCCAGGGTGT 

GATGACCAACATGGATACTGTGACAAACCAGGGGAGTGCAAGTGCAGAGTTGGCTGGCAGGGC 

CGCTACTGCGATGAGTGCATCCGATACCCAGGTTGTCTCCATGGCACCTGCCAGCAACCCTGGC 

AGTGTAACTGCCAGGAAGGCTGGGGGGGCCTTTTCTGCAACCAAGACCTGAACTACTGTACTCA 

CCATAAGCCGTGCAGGAATGGAGCCACCTGCACCAACACGGGCCAGGGGAGCTACACATGTTC 

CTGCCGACCTGGGTATACAGGTGCCAACTGTGAGCTGGAAGTAGATGAGTGTGCTCCTAGCCCC 

TGCAAGAACGGAGCGAGCTGCACGGACCTTGAGGACAGCTTCTCTTGCACCTGCCCTCCCGGCT 

TCTATGGCAAGGTCTGTGAGCTGAGCGCCATGACCTGTGCAGATGGCCCTrGCTTCAATGGAGG 

ACGATGTTCAGATAACCCTGACGGAGGCTACACCTGCCATTGCCCCTTGGGCTTCTCTGGCTTC 

AACTGTGAGAAGAAGATGGATCTCTGCGGCTCTTCCCCTTGTTCTAACGGTGCCAAGTGTGTGG 

ACCTCGGCAACTCTTACCTGTGCCGGTGCCAGGCTGGCTTCTCCGGGAGGTACTGCGAGGACAA 

TGTGGATGACTGTGCCTCCTCCCCGTGTGCAAATGGGGGCACCTGCCGGGACAGTGTGAACGAC 

TTCTCCTGTACCTGCCCACCTGGCTACACGGGCAAGAACTGCAGCGCCCCTGTCAGCAGGTGTG 

AGCATGCACCCTGCCATAATGGGGCCACCTGCCACCAGAGGGGCCAGCGCTACATGTGTGAGT 

GCGCCCAGGGCTATGGCGGCCCCAACTGCCAGTTTCTGCTCCCTGAGCCACCACCAGGGCCCAT 

GGTGGTGGACCTCAGTGAGAGGCATATGGAGAGCCAGGGCGGGCCCTTCCCCTGGGTGGCCGT 

GTGTGCCGGGGTGGTGCTTGTCCTCCTGCTGCTGCTGGGCTGTGCTGCTGTGGTGGTCTGCGTCC 

GGCTGAAGCTACAGAAACACCAGCCTCCACCTGAACCCTGTGGGGGAGAGACAGAAACCATGA 

ACAACCTAGCCAATTGCCAGCGCGAGAAGGACGTTTCTGTTAGCATCATTGGGGCTACCCAGAT 

CAAGAACACCAACAAGAAGGCGGACTTTCACGGGGACCATGGAGCCAAGAAGAGCAGCTTTA 

AGGTCCGATACCCCACTGTGGACTATAACCTCGTTCGAGACCTCAAGGGAGATGAAGCCACGG 

TCAGGGATACACACAGCAAACGTGACACCAAGTGCCAGTCACAGAGCTCTGCAGGAGAAGAG 

AAGATCGCCCCAACACTTAGGGGTGGGGAGATTCCTGACAGAAAAAGGCCAGAGTCTGTCTAC 

TCTACTTCAAAGGACACCAAGTACCAGTCGGTGTATGTTCTGTCTGCAGAAAAGGATGAGTGTG 

TTATAGCGACTGAGGTGTAAGATGGAAGCGATGTGGCAAAATTCCCATTTCTCTCAAATAAAAT 

TCCAAGGATATAGCCCCGATGAATGCTGCTGAGAGAGGAAGGGAGAGGAAACCCAGGGACTG 

CTGCTGAGAACCAGGTTCAGGCGAAGCTGGTTCTCTCAGAGTTAGCAGAGGCGCCCGACACTG 

CCAGCCTAGGCTTTGGCTGCCGCTGGACTGCCTGCTGGTTGTTCCCATTGCACTATGGACAGTTG 

CTTTGAAGAGTATATATTTAAATGGACGAGTGACTTGATTCATATAGGAAGCACGCACTGCCCA 

CACGTCTATCTTGGATTACTATGAGCCAGTCTTTCCT^ 

TCCTTTTTGATACTGAGATGTGTTTTTTTTI 11 CCTAGACGGGAAAAAGAAAACGTGTGTTATTT 

TTiTGGGATTTGTAAAAATAlllirCATGATATCTGTAAAGCTTGAGTATTTTGTGACGTTCATT 

TITITATAATTTAAATTTTGGTAAATATGTACAAAGGCACTTCGGGTCTATG^ 

TTGTATATAAATGTATTTATGGAATATTGTGCAAATGTTATTTGAGTTTTTTACT 

GAAGAAATTCATTTTAAAAATATTTTTCCAAAAtAAATATAATGAACTACA 



Figure 4 

CGGGCAGAGGTGGAAGAGGGGGGAGCGCCTCAAAGAAGCGATCAGAATAATAAAAGGAGGCC 
GGGCTCTTTGCCITCTGGAACGCGCGGCTCTTGAAAG 

GTCGTGCATGCTCCAATCCACGGAGTATATTAGAGCCGGGACGCGGCGGCCGCGGGGGCAGCG 
ACGACGGCAGCCTCGGCGGGAGCACCAGCGCTAGCAGCGGCGGCGGCGTCCGGAGTGCCCGTG 
GCGCGCGGCGCAGCGATGCGGTCCCCACGGACGCGCGGCCGGCCCGGGCGCCCCCTGAGTCTT 



WO 03/012082 PCT/GB02/03409 

5/41 

CTGCTCGCCCTGCTCrGTGCCCTGCGAGCCAAGGTGTGCGGGGCCTCGGGTCAGTTTGAGCTGG 

AGATCCTGTCCATGCAGAACGTGAATGGAGAGCTACAGAATGGGAACTGTTGTGGTGGAGTCC 

GGAACCCTGGCGACCGCAAGTGCACCCGCGACGAGTGTGATACGTACTTCAAAGTGTGCCTCA 

AGGAGTATCAGTCCCGCGTCACTGCCGGGGGACCCTGCAGCTTCGGCTCAGGGTCTACGCCTGT 

CATCGGGGGTAACACCTTCAATCTCAAGGCCAGCCGTGGCAACGACCGTAATCGCATCGTACTG 

CCTTTCAGTTTCGCCTGGCCGAGGTCCTAGACTTrGCTGGTGGAGGCCTGGGATTCCAGTAATG 

ACACTATTCAACCTGATAGCATAATTGAAAAGGCTTCTCACTCAGGCATGATAAACCCTAGCCG 

GCAATGGCAGACACTGAAACAAAACACAGGGATTGCCCACTTCGAGTATCAGATCCGAGTGAC 

CTGTGATGACCACTACTATGGCTTTGGCTGCAATAAGTTCTGTCGTCCCAGAGATGACTrCTTTG 

GACATTATGCCTGTGACCAGAACGGCAACAAAACTTGCATGGAAGGCTGGATGGGTCCTGATT 

GCAACAAAGCTATCTGCCGACAGGGCTGCAGTCCCAAGCATGGGTCTTGTAAACTTCCAGGTG 

ACTGCAGGTGCCAGTACGGTTGGCAGGGCCTGTACTGCGACAAGTGCATCCCGCACCCAGGAT 

GTGTCCACGGCACCTGCAATGAACCCTGGCAGTGCCTCTGTGAGACCAACTGGGGTGGACAGC 

TCTGTGACAAAGATCTGAATTACTGTGGGACTCATCAGCCCTGTCTCAACCGGGGAACATGTAG 

CAACACTGGGCCTGACAAATACCAGTGCTCCTGCCCAGAGGGCTACTCGGGCCCCAACTGTGA 

AATTGCTGAGCATGCTTGTCTCTCTGACCCCTGCCATAACCGAGGCAGCTGCAAGGAGACCTCC 

TCAGGCTTTGAGTGTGAGTGTTCTCCAGGCTGGACTGGCCCCACGTGTTCCACAAACATCGATG 

ACTGTTCrCCAAATAACTGTTCCCATGGGGGCACCTGCCAGGATCTGGTGAATGGATTCAAGTG 

TGTGTGCCCGCCCCAGTGGACTGGCAAGACTTGTCAGTTAGATGCAAATGAGTGCGAGGCCAA 

ACCTTGTGTAAATGCCAGATCCTGTAAGAATCTGATTGCCAGCTACTACTGTGATTGCCTTCCTG 

GCTGGATGGGTCAGAACTGTGACATAAATATCAATGACTGCCTTGGCCAGTGTCAGAATGACG 

CCTCCTGTCGGGATTTGGTTAATGGTTATCGCTGTATCTGTCCACCTGGCTATGCAGGCGATCAC 

TGTGAGAGAGACATCGATGAGTGTGCTAGCAACCCCTGCTTGAATGGGGGTCACTGTCAGAAT 

GAAATCAACAGATTCCAGTGTCTCTGTCCCACTGGTTTCTCTGGAAACCTCTGTCAGCTGGACA 

TCGATTACTGCGAGCCCAACCCTTGCCAGAATGGCGCCCAGTGCTACAATCGTGCCAGTGACTA 

TTTCTGCAAGTGCCCCGAGGACTATGAGGGCAAGAACTGCTCACACCTGAAAGACCACTGCCG 

TACCACCACCTGCGAAGTGATTGACAGCTGCACTGTGGCCATGGCCTCCAACGACACGCCTGAA 

GGGGTGCGGTATATCTCTTCTAACGTCTGTGGTCCCCATGGGAAGTGCAAGAGCCAGTCGGGAG 

GCAAArrCACCTGTGACTGTAACAAAGGOTCACCGGCACCTACTGCCATGAAAATATCAACGA 

CTGCGAGAGCAACCCCTGTAAAAACGGTGGCACCTGCATCGATGGCGTTAACTCCTACAAGTGT 

ATCTGTAGTGACGGCTGGGAGGGAGCGCACTGTGAGAACAACATAAATGACTGTAGCCAGAAC 

CCTTGTCACTACGGGGGTACATGTCGAGACCTGGTCAATGACTTTTACTGTGACTGCAAAAATG 

GCTGGAAAGGAAAGACTTGCCATTCCCGTGACAGCCAGTGTGACGAAGCCACGTGTAATAATG 

GTGGTACCTGCTATGATGAAGTGGACACGTTTAAGTGCATGTGTCCCGGTGGCTGGGAAGGAA 

CAACTTGTAATATAGCTAGAAACAGTAGCTGCCTGCCGAACCCCTGTCATAATGGAGGTACCTG 

CGTGGTCAATGGAGACTCCTTCACCTGTGTCTGCAAAGAAGGCTGGGAGGGGCCTATTTGTACT 

CAAAATACCAACGACTGCAGTCCCCATCCTTGTTACAATAGCGGGACCTGTGTGGACGGAGAC 

AACrGGTATCGGTGCGAATGTGCCCCGGGTTTTGCTGGGCCAGACTGCAGGATAAACATCAATG 

AGTGCCAGTCTTCCCCTTGTGCCTTTGGGGCCACCTGTGTGGATGAGATCAATGGCTACCAGTG 

TATCTGCCCTCCAGGACATAGTGGTGCCAAGTGCCATGAAGTTTCAGGGCGATCTTGCATCACC 

ATGGGGAGAGTGATACTTGATGGGGCCAAGTGGGATGATGACTGTAACACCTGCCAGTGCCTG 

AATGGACGGGTGGCCTGCTCCAAGGTCTGGTGTGGCCCGAGACCTTGCAGGCTCCACAAAAGC 

CACAATGAGTGCCCCAGTGGGCAGAGCTGCATCCCGGTCCTGGATGACCAGTGTTTCGTGCGCC 

CCTGCACTGGTGTTGGCGAATGTCGGTCCTCCAGCCTCCAGCCAGTGAAGACCAAGTGCACATC 

TGACTCCTATTACCAGGATAACTGTGCAAACATCACTTTCACCrTTAACAAAGAGATGATGTCT 

CCAGGTCTTACCACCGAACACATTTGCAGCGAATTGAGGAATTTGAATATCCTGAAGAATGTTT 

CTGCTGAATATTCGATCTACATAGCCTGTGAGCCTTCCCTGTCAGCAAACAATGAAATACACGT 

GGCCATCTCTGCAGAAGACATCCGGGATGATGGGAACCCTGTCAAGGAAATTACCGATAAAAT 

AATAGATCTCGTTAGTAAACGGGATGGAAACAGCTCACTTATTGCTGCGGTTGCAGAAGTCAG 

AGTTCAGAGGCGTCCTCTGAAAAACAGAACAGATTrCCTGGTTCCTCTGCTGAGCTCTGTCITA 

ACAGTGGCITGGGTCTGTTGCTTGGTGACAGCCTTCTACTGGTGTGTACGGAAGCGGCGGAAGC 

CCAGCAGCCACACTCACTCCGCCCCCGAGGACAACACCACCAACAATGTGCGGGAGCAGCTGA 

ACCAAATCAAAAACCCCATCGAGAAACACGGAGCCAACACGGTCCCCATTAAGGATTACGAGA 

ACAAAAACTCCAAAATGTCAAAAATCAGGACACACAACTCGGAAGTGGAGGAGGATGACATG 

GATAAACACCAGCAGAAAGTCCGCTTTGCCAAACAGCCAGTGTATACGCTGGTAGACAGAGAG 

GAGAAGGCCCCCAGCGGCACGCCGACAAAACACCCGAACTGGACAAATAAACAGGACAACAG 
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AGACTTGGAAAGTGCCCAGAGCTTGAACCGGATGGAATACATC 
CCGCCATAGGTAGAGTTTGAGGGCACCGCGGGCCG 



Figure 5 

CTGCGGCCGGCCCGCGAGCTAGGCTGGGTTITTTTTTT^ 

TGATCTAAA^GGGAATAAAAGGCTGCGCATAATCATAATAATAAAAGAAGGGGAGCGCGAGAGAAGGA 

GAAAGCCGGGAGGTGGAAGAGGAGGGGGAGCGTCTCAAAGAAGCGATCAGAATAATAAAAGGAGGCGG 

CTCTTTGCCTTCTGGAACGGGCCGCTCITGAAAGGGCTTTT 

TGCTCCAATCGGCGGAGTATATTAGAGCCGGGACGCGGCGGCCGCAGGGGCAGCGGCGACGGCAGCACG 
GCGGCAGCACCAGCGCGAACAGCAGCGGCGGCGTCCCGAGTGCCCGCGGCGCGCGGCGCAGCGATGCGT 
CCCCACGGACGCGCGGCCGGTCCGGGCGCCCCCTAAGCCTCCTGCTCGCCCTGCTCTGTGCCCT 

CAAGGTGTGTGGGGCCTCGGGTCAGTTCGAGTTGGAGATCCTGTCCATGCAGAACGTGAACGGGGAGCTG 
CAGAACGGGAACTGCTGCGGCGGCGCCCGGAACCCGGGAGACCGCAAGTGCACCCGCGACGAGTGTGAA 
CATACTTCAAAGTGTGCCTCAAGGAGTATCAGTCCCGCGTCACGGCCGGGGGGCCCTGCAGCTTCGG 
AGGGTCCACGCCTGTCATCGGGGGCAACACCTTCAACCTCAAGGCCAGCCGCGGCAACGACCGCAACCC 

ATCGTGCTGCCTTTCAGTTTCGCCT^ 

ATGACACCGTTCAACCTGACAGTATTATTGAAAAGGCITCTCACTCGGG 

GTGGCAGACGCTGAAGCAGAACACGGGCGTTGCCCACTTTGAGTATCAGATCCGCGTGACCTGTGATGAC 
TACTACTATGGCnTTGGCTGCAATAAGTTCTGCCGCCCCAGAGATGACT^ 

ACCAGAATGGCAACAAAACTTGCATGGAAGGCTGGATGGGCCCCGAATGTAACAGAGCTATTTGCCGAA 
AGGCTGCAGTCCTAAGCATGGGTCITGCAAACTCCCAGGTGACTGCAGGTGCCAGTATGGCT 
CTGTACTGTGATAAGTGCATCCCACACCCGGGATGCGTCCACGGCATCTGTAATGAGCCCTGGCAGTGCC 
TCTGTGAGACCAACTGGGGCGGCCAGCTCTGTGACAAAGATCT 

TCTCAACGGGGGAACTTGTAGCAACACAGGCCCTGACAAATATCAGTGTTCCTGCCCTGAGGGGTATTCA 
GGACCCAACTGTGAAATTGCTGAGCACGCCTGCCTCTCTGATCCCTGTCACAACAGAGGCAGCTGTAAGG 
AGACCTCCCTGGGCTTTGAGTGTGAGTGTTCCCCAGGCTGGACCGGCCCCACATGCTCTACAAACATT . 
TGACTGTTCTCCTAATAACTGTTCCCACGGGGGCACCTGCCAGGACCTGGTTAA 

TGCCCCCCACAGTGGACTGGGAAAACGTGCCAGTTAGATGCAAATGAATGTGAGGCCAAACCTTGTGTAA 

ACGCCAAATCCTGTAAGAATCTCATC^ 

TTGTGACATAAATATTAATGACTGCCTTGGCCAGTGTC^ 

GGTTATCGCTGTATCTGTCCACCTGGCTATGCAGGCGATCACTGTGAGAGAGACATCGATGAATGTGCCA 

GCAACCCCTGTTTGAATGGGGGTCACTGTCAGAATGAAATCAACAGATTCCAGTGTCTGTGTCCCACT 

TTTCTCTGGAAACCTCTGTCAGCTGGACATCGATTATTGTGAGCCTAATCCCTG 

TGCTACAACCGTGCCAGTGACTATTTCTGCAAGTGCCCCGAGGACTATGAGGGCAAGAACTGCTCACACC 

TGAAAGACCACTGCCGCACGACCCCCTGTGAAGTGATTGACAGCTGCACAGTGGCCATGGCTTCCAACGA 

CACACCTGAAGGGGTGCGGTATATTTCCTCCAACGTCTGTGGTCCTCACGGGAAGTGCAAGAGTCAGTCG 

GGAGGCAAATTCACCTGTGACTGTAACAAAGGCTTCACGGGAACATACTGCCATGAA^ 

GTGAGAGCAACCCITGTAGAAACGGTGGCACTTGCATCGATGGTGTCAACTCCTACAAGTG 

TGACGGCTGGGAGGGGGCCTACTGTGAAACCAATATTAATGACTGCAGCCAGAACCCCTGCCACAATGG 

GGCACGTGTCGCGACCTGGTCAATGACTTCTACTGTGACTGTAAAAATGGGTGGA^ 

ACTCACGTGACAGTCAGTGTGATGAGGCCACGTGCAACAACGGTGGCACCTGCTATGATGAGGGGGATC 

TITrAAGTGCATGTGTCCTGGCGGCTGGGAAGGAACAACCTGTAACATAGCCCGAAACAGTAGCT 

CCCAACCCCTGCCATAATGGGGGCACATGTGTGGTCAACGGCGAGTCCTITACGTGCGTCTGCAAGGAAG 

GCTGGGAGGGGCCCATCTGTGCrCAGAATACCAATGACTGCAGCCCTCATCCCTGTTACAA 

CTGTGTGGATGGAGACAACTGGTACCGGTGCGAATGTGCCCCGGGTTTTGCTGGGCCCGACTGCAGAATA 

AACATCAATGAATGCCAGTCITCACCITGTGCCTTTGGAGCGAOT 

GGTGTGTCTGCCCTCCAGGGCACAGTGGTGCCAAGTGCCAGGAAGTTTCAGGGAGACCTTGCATCACCAT 

GGGGAGTGTGATACCAGATGGGGCCAAATGGGATGATGACTGTAATACCTGCCAGTGCCTGAATGGACG 

ATCGCCTGCTCAAAGGTCTGGTGTGGCCCTX^ 

GCGGGCAGAGCTGCATCCCCATCCTGGACGACCAGTC 

TCGGTCTTCCAGTCTCCAGCCGGTGAAGACAAAGTGCACCTCT 

AACATCACATTTACCITrAACAAGGAGATGATGTCACCAGGTCIT 

TGAGGAATTTGAATATTTTGAAGAATGTTTCCGCTGAATATTC 

TTCAGCGAACAATGAAATACATGTGGCCATTTCTGCTGAAGATATACGGGATGATGGGAACCCGATCAAG 

GAAATCACTGACAAAATAATCGATCTTGTTAGTAAAC^ 

CAGAAGTAAGAGTTCAGAGGCGGCCTCTGAAGAACAGAACAGATCT 

CITAACrGTGGCITGG^ 

GGCAGCCACACACACTCAGCCTCTGAGGACAACACCACCAACAACGTGCGGGAGCAGCTGAACCAGATA 
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AAAACCCCATTGAGAAACATGGGGCCAACACGGTC^ 
GTCTAAAATAAGGACACACAATTCTGAAGTAGAAGAGGACGACATGGAC^ 
GTITGCCAAGCAGCCGGCGTATACGCTGGTAGACAGAGAAGAGAAGCCCCCCAACGGCACGC^ 

. j. * /-vt-o/-.»/-,a a aoa a a r> a nn ArA4rAr,Af?A rTTOfr A A AGTGCCCAGAGCTTAAACCGAATGG AG^ 




C ATCGTATAGCAGACCGCGGGCACi CiCUUUUUU 1 auu i auau i ^ i u/iuuu^ i luln ^' * * . 7^-™-, 
CGTGTCATACTCGAGTCTGAGGCCGTrGCTGACTTAGAATCCCTGTGTTAATTTAAGTTTTGACAAGCT 
GCTTACACTGGCAATGGTAGTTTCTGTGGTTGGCTGGGAAATCGAGTGCCGCATCTCACAGCTATGCAAA 
AAGCTAGTCAACAGTACCCTGGTTGTGTGTCCCCTTGCAGCCGACACGGTCTCGGATCAGGCIGCCAG 
GCCTGCCCAGCCCCCTGGTCTITGAGCTCCCACTTCTGCCAGATGTCCrAATGGTGATGCAGTCTTAGAT 

CATAGTTTTATTTATATTTATTGACTCTTGAGTTGT^ 

GTTCTGTATITGAAAGTGCCrmGCAGCTCAGAACCACAGCAACGATCACAAATGACTTTATTATTTA 
TTTTTAATTGTATTTTTGTTGTTGGGG 

TTTAAAGAAAAAAATGTCAAAAGTAGAACTTTGTATAGTTATGTAAATAATTCTTIT^ 

TGTATATTTGATTTATTAACTITAATAATCAAGAGCCTrAAAACATCATTCCTTTTT 

GTTTAGAATTGAAGGTTTTTGATAGCATTGTAAGCGTATGGCTTTATTTTT^ 

TGTTGCCTATAAGCCAAAATTAAGGTGTTTGAAAATAGTTTATTTTAAAACAATAGGATGGGCTTCTGTG 

CCCAGAATACTGATGGAATTTTTTTTGTACGACGTCAGATGTTTAAAACACCTTCT^ 

AACACGTTTTAAGGACTGACTGAGGCAGTTTGAGGATTAGTTTAGAACAGGTTTTTTTGTT^ 

TTrGTTTTTCTGCTTTAGACTTGAAAAGAGACAGGCAGGTGATCTGCTGCAGAGCAGTAAGGGAACAAGT 

TGAGCTATGACTTAACATAGCCAAAATGTGAGTGGTTGAATATGATTAAAAATATCAAATTAATTGTGTG 

AACTTGGAAGCACACCAATCTGACTTTGTAAATTCTGATTTCTTTTCACCATTCGTACATAATACTGAAC 

CACTTGTAGATITGATTTTTTTTTrAATCTACTGCATTTAGGGAGTATTCTAATA^ 

TGAACCATAAAATGTCCAGTAAGATCACTGTTTAGATTTGCCATAGAGTACACTGCCTGCCTTAAGTGAG 
GAAATCAAAGTGCTATTACGAAGTTCAAGATCAAAAAGGCITATAAAACAGAGTAATCTTGTTGGTTCAC 
CATTGAGACCGTGAAGATACTTTGTATTGTCCTATTAGTGTTATATGAACATACAAATGCATCTTTGATG 
TGTTGTTCTTGGCAATAAATTTTGAAAAGTAATATTTATTAAATTTT^ 

GTGGCTCTTCTGAGCITACGTAGTTCTACCGGCTTrGCCGTGTGCTTCTGCCACCCrGCTGAGTCTGTTC 
TGGTAATCGGGGTATAATAGGCTCTGCCTGACAGAGGGATGGAGGAAGAACTGAAAGGCTTTTCAACCC 
AAAACTCATCTGGAGTTCTCAAAGACCTGGGGCTGCTGTGAAGCTGGAACTGCGGGAGCCCCATCTAGGG 
GAGCCTTGATTCCCTrGTTATTCAACAGCAAGTGTGAATACTGCTTGAATAAACACCACTGGATTAATGG 

AAAAAAAAAAAAAAAA 



Figure 6 

GGAGCGGGCGCGCGGCGGCGGCGGGGCCGCGGCGGGCGGGTCGCGGGGGCAATGCGGGCGCAGGGCCG 

GGGCCTTCCCCCCGGCGCTGCTGCTGCTGCTGGCGCTCTGGGTGCAGGCGGCGCGGCCCATGGGCTATTT 

CGAGCTGCAGCTGAGCGCGCTGCGGAACGTGAACGGGGAGCTGCTGAGCGGCGCCTGCTGTGACGGCGC 

GGCCGGACAACGCGCGCGGGGGGCTGCGGCCACGACGAGTGCGACACGTACGTGCGCGTGTGCCTTAAG 

AGTACCAGGCCAAGGTGACGCCCACGGGGCCCTGCAGCTACGGCCACGGCGCCACGCCCGTGCTGGGCG 

CAACTCCTTCTACCTGCCGCCGGCGGGCGCTGCGGGGGACCGAGCGCGCGCGCGGCCCCGGGCCGGCGC 

GACCAGGACCCGGGCTrCGTCGTCATCCCCTTCCAGTTCGCCTGGCCGCGCTCCTTTACCCTCATCGTGG 

AGGCCTGGGACTGGGACAACGATACCACCCCGAATGAGGAGCTGCTGATCGAGCGAGTGTCGCATGCCG 

CATGATCAACCCGGAGGACCGCTGGAAGAGCCTGCACTTCAGCGGCCACGTGGCGCACCTGGAGCTGCG 

ATCCGCGTGCGCTGCGACGAGAACTACTACAGCGCCACTTGCAACAAGTTCTGCCGGCCCCGCAACGACT 

TTTTCGGCCACTACACCTGCGACCAGTACGGCAACAAGGCCTGCATGGACGGCTGGATGGGCAAGGAGTG 

CAAGGAAGCTGTGTGTAAACAAGGGTGTAATTTGCTCCACGGGGGATGCACCGTGCCTGGGGAGTGCAG 

TGCAGCTACGGCTGGCAAGGGAGGTTCTGCGATGAGTGTGTCCCCTACCCCGGCTGCGTGCATGGCAGTT 

GTGTGGAGCCCTGGCAGTGCAACTGTGAGACCAACTGGGGCGGCCTGCTCTGTGACAAAGACCTGAACTA 

CTGTGGCAGCCACCACCCCTGCACCAACGGAGGCACGTGCATCAACGCCGAGCCTGACCAGTACCGCTGC 

ACCTGCCCTGACGGCTACTCGGGCAGGAACTGTGAGAAGGCTGAGCACGCCTGCACCTCCAACCCGTGTG 

CCAACGGGGGCTCTTGCCATGAGGTGCCGTCCGGCTTCGAATGCCACTGCCCATCGGGCTGGAGCGGGCC 

CACCTGTGCCCTTGACATCGATGAGTGTGCTTCGAACCCGTGTGCGGCCGGTGGCACCTGTGTGGACCAG 

GTGGACGGCTTTGAGTGCATCTGCCCCGAGCAGTGGGTGGGGGCCACCTGCCAGCTGGACGTCAACGACT 

GTGAAGGGAAGCCATGCCTTAACGCTTTTTCTTGCAAAAACCTGATrGGCGGCTATTACTGTGATTGCAT 

CCCGGGCTGGAAGGGCATCAACTGCCATATCAACGTCAACGACTGTCGCGGGCAGTGTCAGCATGGGGC 

ACCTGCAAGGACCTGGTGAACGGGTACCAGTGTGTGTGCCCACGGGGCTTCGGAGGCCGGCATTGCGAGC 

TGGAACGAGACAAGTGTGCCAGCAGCCCCTGCCACAGCGGCGGCCTCTGCGAGGACCTGGCCGACGGCT 

CCACTGCCACTGCCCCCAGGGCTTCTCCGGGCCTCTCTGTGAGGTGGATGTCGACCTTTGTGAGCCAAGC 

CCCTGCCGGAACGGCGCTCGCTGCTATAACCTGGAGGGTGACTATTACTGCGCCTGCCCTGATGACTTTG 

GTGGCAAGAACTGCTCCGTGCCCCGCGAGCCGTGCCCTGGCGGGGCCTGCAGAGTGATCGATGGCTGCGG 
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GTCAGACGCGGGGCCTGGGATGCCTGGCACAGCAGCCTCCGGCGTGTGTGGCCCCCATGGACGCTGCGTC 

AGCCAGCCAGGGGGCAACTTTTCCrGCATCTGTGACAGTGGCTTTACTGGCACCTACTGCCATGAGAACA 

TTGACGACTGCCTGGGCCAGCCCTGCCGCAATGGGGGCACATGCATCGATGAGGTGGACGCCTTCCGCTG 

CTTCTGCCCCAGCGGCTGGGAGGGCGAGCTCTGCGACACCAATCCCAACGACTGCCTTCCCGATCCCTGC 

CACAGCCGCGGCCGCTGCTACGACCTGGTCAATGACTTCTACTGTGCGTGCGACGACGGCTGGAAGGGCA 

AGACCTGCCACTCACGCGAGTTCCAGTGCGATGCCTACACCTGCAGCAACGGTGGCACCTGCTACGACAG 

CGGCGACACCTTCCGCTGCGCCTGCCCCCCCGGCTGGAAGGGCAGCACCTGCGCCGTCGCCAAGAACAGC 

AGCTGCCTGCCCAACCCCTGTGTGAATGGTGGCACCTGCGTGGGCAGCGGGGCCTCCTTCTCCTGCATCT 

GCCGGGACGGCTGGGAGGGTCGTACTTGCACTCACAATACCAACGACTGCAACCCTCTGCCTTGCTACAA 

TGGTGGCATCTGTGTTGACGGCGTCAACTGGTTCCGCTGCGAGTGTGCACCTGGCTTCGCGGGGCCTGAC 

TGCCGCATCAACATCGACGAGTGCCAGTCCTCGCCCTGTGCCTACGGGGCCACGTGTGTGGATGAGATCA 

ACGGGTATCGCTGTAGCTGCCCACCCGGCCGAGCCGGCCCCCGGTGCCAGGAAGTGATCGGGTTCGGGAG 

ATCCTGCTGGTCCCGGGGCACTCCGTTCCCACACGGAAGCTCCTGGGTGGAAGACTGCAACAGCTGCCGC 

TGCCTGGATGGCCGCCGTGACTGCAGCAAGGTGTGGTGCGGATGGAAGCCTTGTCTGCTGGCCGGCCAGC 

CCGAGGCCCTGAGCGCCCAGTGCCCACTGGGGCAAAGGTGCCTGGAGAAGGCCCCAGGCCAGTGTCTGG 

ACCACCCTGTGAGGCCTGGGGGGAGTGCGGCGCAGAAGAGCCACCGAGCACCCCCTGCCTGCCACGCTC 

GGCCACCTGGACAATAACTGTGCCCGCCTCACCTTGCATTTCAACCGTGACCACGTGCCCCAGGGCACCA 

CGGTGGGCGCCATTTGCTCCGGGATCCGCTCCCTGCCAGCCACAAGGGCTGTGGCACGGGACCGCCTGCT 

GGTGTTGCTTTGCGACCGGGCGTCCTCGGGGGCCAGTGCCGTGGAGGTGGCCGTGTCCTTCAGCCCTGCC 

AGGGACCTGCCTGACAGCAGCCTGATCCAGGGCGCGGCCCACGCCATCGTGGCCGCCATCACCCAGCGG 

GGAACAGCTCACTGCTCCTGGCTGTCACCGAGGTCAAGGTGGAGACGGTTGTTACGGGCGGCTCTTCCAC 

AGGTCTGCTGGTGCCTGTGCTGTGTGGTGCCTTCAGCGTGCTGTGGCTGGCGTGCGTGGTCCTGTGCGTG 

TGGTGGACACGCAAGCGCAGGAAAGAGCGGGAGAGGAGCCGGCTGCCGCGGGAGGAGAGCGCCAACAC 

AGTGGGCCCCGCTCAACCCCATCCGCAACCCCATCGAGCGGCCGGGGGGCCACAAGGACGTGCTCTACCA 

GTGCAAGAACTTCACGCCGCCGCCGCGCAGGGCGGACGAGGCGCTGCCCGGGCCGGCCGGCCACGCGGC 

GTCAGGGAGGATGAGGAGGACGAGGATCTGGGCCGCGGTGAGGAGGACTCCCTGGAGGCGGAGAAGTTC 

TCTCACACAAATTCACCAAAGATCCTGGCCGCTCGCCGGGGAGGCCGGCCCACTGGGCCTCAGGCCCCAA 

AGTGGACAACCGCGCGGTCAGGAGCATCAATGAGGCCCGCTACGCCGGCAAGGAGTAGGGGCGGCTGCG 

CTGGGCCGGGACCCAGGGCCCTCGGTGGGAGCCATGCCGTCTGCCGGACCC GGAG CCGAGGCATGTGCT 

AGTTTCTTTATTTTGTGTAAAAAAACCACCAAAAACAAAAACCAAATGTTTATm 

CCTTGTATAAATTATTCAGTAACTGTCAGGCTGAAAACAATC 

TAAAGTTTCCGTGCGTGGCACTCGCTGTATGAAAGGAGAGAGCAAAGGGTGTCTGCGTCGTCACCAAATC 

GTAGCGTTTGTTACCAGAGGTTGTGCACTGTTTACAGAATCrTCCrmTATTCCTCA 

GTGGCTCCAGGCCAAAGTGCCGGTGAGACCCATGGCTGTGTTGGTGTGGCCCATGGCTGTTGGTGGGACC 

CGTGGCTGATGGTGTGGCCTGTGGCTGTCGGTGGGACTCGTGGCTGTCAATGGGACCTGTGGCTGTCGGT 

GGGACCTACGGTGGTCGGTGGGACCCTGGTTATTGATGTGGCCCTGGCTGCCGGCACGGCCCGTGGCTGT 

TGACGCACCTGTGGTTGTTAGTGGGGCCTGAGGTCATCGGCGTGCCCAAGGCCGGCAGGTCAACCTCGCG 

CTTGCTGGCCAGTCCACCCTGCCTGCCGTCTGTGCTTCCTCCTGCCCAGAACGCCCGCTCCAGCGATCTC 

TCCACTGTGCTTTCAGAAGTGCCCTTCCTGCTGCGCAGTTCTCCCATCCTGGGACGGCGGCAGTATTGAA 

GCTCGTGACAAGTGCCTTCACACAGACCCCTCGCAACTGTCCACGCGTGCCGTGGCACCAGGCGCTGCCC 

ACCTGCCGGCCCCGGCCGCCCCTCCTCGTGAAAGTGCATTTTTGTAAATGTGTACATATTAAAGGAAGCA 
CTCTGT AT ATTTG ATTG AAT AATGCC ACC AAAAAAAAAAAAAA AAAAAAATTCCTG CCC 



Figure 7 

TCGAGGCGGCGATGCGGGCACGCGGCTGGGGACGCCTGCCTCGGCGGCTGCTGCTGCTACTGG 

TTCTGTGCGTGCAGGCGACGCGGCCCATGGGCTATTTCGAGCTGCAGCTGAGCGCGCTGCGGAA 

CGTGAACGGGGAGCTGCTGAGCGGCGCCTGCTGTGACGGCGACGGCCGGACGACGCGCGCGGG 

GGGCTGCGGCCGCGACGAGTGCGACACGTACGTGCGCGTGTGCCTTAAGGAGTACCAGGCCAA 

GGTGACGCCCACGGGGCCCTGCAGCTACGGCTACGGCGCCACGCCCGTGCTGGGTGGCAACTC 

CTTCTACCTGCCGCCGGCGGGCGCTGCGGGGGACCGAGCGCGCGCGCGGTCTCGGACCGGCGG 

CCACCAGGACCCGGGCCTCGTCGTCATTCCCITTCAGTTCGCCTGGCCGCGTTCTTTCACCCTCA 

TCGTGGAGGCCTGGGACTGGGACAATGACACCACTCCAGATGAGGAGCTGCTGATTGAGCGGG 

TGTCGCACGCTGGCATGATCAACCCCGAGGACCGCTGGAAGAGCCTGCACTTCAGCGGCCACG 

TGGCACACCTGGAGCTGCAGATCCGAGTGCGCTGTGATGAGAACTACTACAGTGCCACCTGCA 

ACAAGTTCTGCCGGCCCCGCAACGACTTCTTTGGCCACTATACCTGCGACCAGTACGGCAACAA 

GGCCTGCATGGATGGCTGGATGGGCAAAGAATGCAAAGAAGCCGTGTGTAAACAAGGATGTAA 

TTTGCTCCACGGGGGATGCACTGTGCCTGGGGAGTGCAGGTGCAGCTACGGCTGGCAGGGCAA 
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GTTCTGTGACGAGTGTGTCCCCTACCCTGGCTGCGTGCATGGCAGCTGTGTGGAGCCCTGGCAC 

TGTGACTGTGAGACCAACTGGGGTGGCCTGCTCTGCGACAAAGACCTGAACTACTGTGGCAGC 

CACCACCCCTGTGTCAACGGGGGTACCTGCATCAATGCTGAGCCTGACCAATACCTCTGCGCCT 

GCCCAGATGGCTACTTGGGCAAGAACTGTGAGCGGGCTGAGCACGCCTGTGCCTCCAACCCGT 

GTGCCAATGGGGGCTCTTGCCACGAAGTGCCATCTGGCTTTGAATGCCACTGTCCGTCAGGATG 

GAGCGGACCCACCTGTGCGCTCGACATTGATGAGTGTGCCTCTAACCCATGTGCAGCGGGTGGT 

ACCTGCGTGGATCAGGTGGACGGCTTCGAGTGCATCTGCCCGGAGCAGTGGGTGGGGGCTACT 

TGCCAGCTGGACGCCAATGAGTGTGAAGGGAAGCCGTGCCTTAATGCTITITCTTGCAAAAACC 

TGATTGGCGGCTATTACTGTGATTGCCTCCCGGGCTGGAAGGGCATCAACTGCCAAATCAACAT 

CAACGATTGTCATGGGCAGTGTCAGCATGGGGGCACCTGCAAGGACCTGGTCAATGGGTACCA 

GTGTGTGTGCCCGCGGGGCTTTGGAGGTCGCCATTGCGAACTAGAGTACGACAAGTGTGCCAG 

CAGCCCCTGCCGCCGGGGTGGCATCTGCGAGGACCTGGTGGATGGCTTCCGCTGCCACTGCCCA 

CGGGGCCTCTCTGGGCTGCACTGTGAGGTGGACATGGATCTCTGTGAACCAAGCCCCTGCCTCA 

ACGGTGCTCGCTGCTACAACCITGAGGGTGACTACTACTGCGCCTGCCCAGAAGACTTTGGTGG 

CAAGAACTGCTCAGTGCCCAGGGACACATGCCCTGGCGGGGCATGTAGAGTGATCGATGGCTG 

CGGGTTCGAGGCAGGGTCCAGGGCACGCGGTGTCGCACCCTCTGGTATATGTGGCCCTCACGG 

GCACTGCGTTAGCCTGCCTGGGGGAAACTTCTCCTGCATCTGTGACAGCGGCTTCACAGGCACC 

TACTGCCATGAAAACATTGACGACTGCATGGGCCAGCCCTGCCGCAACGGGGGCACGTGCATT 

GACGAAGTGGACTCCTTCCGCTGCTTCTGCCCCAGTGGCTGGGAAGGAGAACTCTGTGACATCA 

ATCCCAACGACTGCCTCCCCGACCCCTGCCACAGCCGCGGCCGCTGCTATGACCTGGTCAATGA 

CTrCTACTGTGCCTGTGACGATGGCTGGAAGGGCAAGACCTGCCACTCACGCGAGTTCCAGTGT 

GACGCCTACACCTGCAGCAACGGTGGCACATGCTATGACAGCGGCGACACCTTCCGCTGCGCG 

TGCCCTCCGGGCTGGAAGGGCAGCACCTGCACCATCGCCAAGAACAGCAGCTGTGTGCCCAAT 

CCCTGTGTGAATGGAGGCACCTGCGTGGGTAGCGGAGACTCTTTCTCCTGCATCTGCCGGGATG 

GCTGGGAGGGCCGCACCTGCACACATAACACCAATGACTGCAACCCTCTGCCCTGCTATAACG 

GAGGCATCTGTGTTGATGGCGTCAACTGGTTCCGCTGCGAGTGTGCGCCTGGCTTTGCGGGTCC 

TGACTGCCGTATCAACATTGATGAGTGCCAGTCCTCGCCCTGTGCCTACGGAGCCACGTGTGTG 

GATGAGATCAACGGGTACCGCTGCAGCTGCCCACCAGGTCGTTCTGGCCCCAGGTGCCAGGAA 

GTGGTCATATTCACGAGGCCCTGCTGGTCCCGGGGAATGTCCTTCCCGCATGGGAGTTCCTGGA 

TGGAAGACTGCAACAGCTGCCGCTGCCTGGATGGCCACCGGGATTGTAGCAAGGTATGGTGCG 

GATGGAAGCCTTGCCTGCTCTCTGGTCAGCCCAGCGATCCGAGTGCCCAGTGCCCCCCAGGGCA 

GCAATGTCAGGAGAAGGCCGTGGGTCAGTGCTTGCAGCCACCCTGTGAGAACTGGGGGGAGTG 

TACAGCGGAGGAGCCTCTGCCACCCAGCACCCCCTGTCAGCCACGGAGCAGTCATTTGGACAA 

CAACTGTGCCCGACTCACACTGCGCTTCAACCGTGATCAAGTGCCTCAGGGCACCACCGTGGGC 

GCTATCTGCTCTGGAATCCGAGCCTTGCCTGCCACGAGGGCGGCGGCACACGACCGCCTCCTCC 

TGCTGCTTTGTGATCGAGCATCCTCGGGGGCCAGTGCTGTGGAGGTGGCTATGTCTTTCAGCCC 

TGCAAGGGACCTGCCTGACAGCAGCCTGATCCAGAGCACAGCCCACGCCATCGTGGCTGCTAT 

CACTCAGAGAGGAAATAGCTCACTGCTGCTGGCTGTCACCGAGGTCAAGGTGGAAACAGTTGT 

TATGGGTGGCTCTTCCACAGGTCTGTTGGTGCCCGTGCTGTGCAGCGTGTTCAGTGTGCTGTGGC 

TCGCCTGTGTGGTTATCTGCGTATGGTGGACACGAAAGCGCAGGAAAGAACGTGAGAGGAGCC 

GGCTACCACGGGATGAGAGCACCAACAACCAGTGGGCCCCGCTCAATCCCATCCGCAACCCCA 

TTGAGCGGCCAGGCGGCAGCGGTCTGGGAACTGGGGGCCACAAGGACATACTCTACCAGTGCA 

AAAACTTCACACCGCCGCCCCGCAGGGCAGGCGAGGCACTGCCCGGGCCAGCTGGCCATGGGG 

CTGGTGGGGAGGACGAGGAGGATGAAGAGCTGAGCCGTGGAGATGGGGACTCCCCAGAGGCA 

GAGAAGTTCATCTCACACAAGTTCACCAAAGACCCCAGCTGCTCCCTCGGAAGGCCAGCCTGCT 

GGGCTCCAGGGCCCAAAGTGGACAACCGCGCCGTCAGAAGTACCAAGGACGTGCGCCGTGCTG 

GCAGGGAGTAGCCAGCCACCAGGCTGGCACCCAGAACCCTTGCTGGCACCACGCTGCCTGCCG 

GACCATAGGAGGCCAAGGCCGTGTGCATAGTTTCTITATTTTGTGTAAAAAACAAAACCAAAAC 

CAAAAAACAAATGTITATTTTTTACGTTTCTITAACCT^ 

CGGAAAACAACGGAGTATTCTCGGATCATTGCTATmTGTAAAGTTTCCGCGTCCGCACGCAC 
TGTGGCAGGAGAGCAGGGCGTGTGTATGTGTGTGTGTGTGTGTCCTCACC 

Figure 8 

GAAGGCCATGGTCTCCCCACGGATGTCCGGGCTCCTCTCCCAGACTGTGATCCTAGCGCTCATTTTCCT 

CCCCAGACACGGCCCGCTGGCGTCTTCGAGCTGCAGATCCACTCrTTCGGGCCGGGTCCAGGCCCTGGGG 

CCCCGCGGTCCCCCTGCAGCGCCCGGCTCCCCTGCCGCCrCTTCTTCAGAGTCTGCCTGAAGCCTGGGCT 
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CTCAGAGGAGGCCGCCGAGTCCCCGTGCGCCCTGGGCGCGGCGCTGAGTGCGCGCGGACCGGTCTACACC 

GAGCAGCCCGGAGCGCCCGCGCCTGATCTCCCACTGCCCGACGGGCrCITGCAGGTGCC 

CCTGGCCTGGCACCTTCTCnTCATCATCGAAACCTGGAGAGAGGAGTTAGGAGACCAGATTGGAGGGCC 

CGCCTGGAGCCTGCTGGCGCGCGTGGCTGGCAGGCGGCGCTTGGCAGCCGGAGGCCCGTGGGCCCGGGC 

ATTCAGCGCGCAGGCGCCTGGGAGCTGCGCTTCTCGTACCGCGCGCGCTGCGAGCCGCCTGCCGTCGGGA 

CCGCGTGCACGCGCCTCTGCCGTCCGCGCAGCGCCCCCTCGCGGTGCGGTCCGGGACTGCGCCCCTGCGC 

ACCGCTCGAGGACGAATGTGAGGCGCCGCTGGTGTGCCGAGCAGGCTGCAGCCCTGAGCATGGCTTCTGT 

GAACAGCCCGGTGAATGCCGATGCCTAGAGGGCTGGACTGGACCCCTCTGCACGGTCCCTGTCTCC^ 

GCAGCTGCCTCAGCCCCAGGGGCCCGTCCTCTGCTACCACCGGATGCCTTGTCCCTGGGCCTGGGCCCTG 

TGACGGGAACCCGTGTGCCAATGGAGGCAGCTGTAGTGAGACACCCAGGTCCrrTTGAATGCACCTGCCC^G 

CGTGGGTTCTACGGGCTGCGGTGTGAGGTGAGCGGGGTGACATGTGCAGATGGACCCTGCTTCAACGGCG 

GCTTGTGTGTCGGGGGTGCAGACCCTGACTCTGCCTACATCTGCCACTGCCCACCTGGTTTCCAAGGCTC 

CAACTGTGAGAAGAGGGTGGACCGGTGCAGCCTGCAGCCATGCCGCAATGGCGGACTCTGCCTGGACCG 

GGCCACGCCCTGCGCTGCCGCTGCCGCGCCGGCTTCGCGGGTCCTCGCTGCGAGCACGACCTGGACGACT 

GCGCGGGCCGCGCCTGCGCTAACGGCGGCACGTGTGTGGAGGGCGGCGGCGCGCACCGCTGCTCCTGCC 

GCTGGGCTTCGGCGGCCGCGACTGCCGCGAGCGCGCGGACCCGTGCGCCGCGCGCCCCTGTGCTCACGGC 

GGCCGCTGCTACGCCCACTTCTCCGGCCTCGTCTGCGCTTGCGCTCCCGGCTACATGGGAGCGCGGTGTG 

AGTTCCCAGTGCACCCCGACGGCGCAAGCGCCTTGCCCGCGGCCCCGCCGGGCCTCAGGCCCGGGGACCC 

TCAGCGCTACCTnTGCCTCCGGCTCTGGGACTGCTCGTGGCCGCGGGCGTGGCCGGCGGTGCGCTCTTG 

CTGGTCCACGTGCGCCGCCGTGGCCACTCCCAGGATGCTGGGTCTCGCTTGCTGGCTGGGACCCCGGAGC 

CGTCAGTCCACGCACTCCCGGATGCACTCAACAACCTAAGGACGCAGGAGGGTTCCGGGGATGGTCCGG 

CTCGTCCGTAGATTGGAATCGCCCTGAAGATGTAGACCCTCAAGGGATTTATGTCATATCTGCTCCTTCC 

ATCTACGCTCGGGAGGTAGCGACGCCCCTnTCCCCCCGCTACACACTGGGCGCGCTGGGCAGAGGCAGC 

ACCTGCTTTTTCCCTACCCTTCCTCGATTCTGTCCGTGAAATGAATTGGGTAGAGTCTCTGGAAGGTTTT 

AAGCCCATTITCAGTTCTAACTTACTTTCATCCTATTTTGCATCCCTCTTATCGTTTTGA 

ATCTTCTCTTT 
Figure 9 

AAACCGGAACGGGGCCCAACTTCTGGGGCCTGGAGAAGGGAAACGAAGTCCCCCCCGGTTTCCCGAGGT 

GCCTTTCCTCGGGCATCCTTGGTTTCGGCGGGACTTCGCAGGGCGGATATAAAGAACGGCGCCTTTGGGA 

AGAGGCGGAGACCGGCTTTAAAGAAAGAAGTCTTGGTCCTGCGGCTTGGGCGAGGCAAGGGCGAGGCAG 

GGCGCTTTCTGCCGACGCTCCCCGTGGCCCTACGATCCCCCGCGCGTCCGCCGCTGTTCTAAGGAGAGAA 

GTGGGGGCCCCCCAGGCTCGCGCGTGGAGCGAAGCAGCATGGGCAGTCGGTGCGCGCTGGCCCTGGCGT 

GCTCTCGGCCTTGCTGTGTCAGGTCTGGAGCTCTGGGGTGTTCGAACTGAAGCTGCAGGAGTTCGTCAAC 

AAGAAGGGGCTGCTGGGGAACCGCAACTGCTGCCGCGGGGGCGCGGGGCCACCGCCGTGCGCCTGCCGA 

CCTTCTTCCGCGTGTGCCTCAAGCACTACCAGGCCAGCGTGTCCCCCGAGCCGCCCTGCACCTACGGCAG 

CGCCGTCACCCCCGTGCTGGGCGTCGACTCCTTCAGTCTGCCCGACGGCGGGGGCGCCGACTCCGCGTTC 

AGCAACCCCATCCGCTTCCCCTTCGGCTrCACCTGGCCGGGCACCTTCTCTCTGATTATTGAAGCTCTCC 

ACACAGATTCTCCTGATGACCTCGCAACAGAAAACCCAGAAAGACTCATCAGCCGCCTGGCCACCCAGAG 

GCACCTGACGGTGGGCGAGGAGTGGTCCCAGGACCTGCACAGCAGCGGCCGCACGGACCTCAAGTACTC 

TACCGCTTCGTGTGTGACGAACACTACTACGGAGAGGGCTGCTCCGTTTTCTGCCGTCCCCGGGACGATG 

CCTTCGGCCACTTCACCTGTGGGGAGCGTGGGGAGAAAGTGTGCAACCCTGGCTGGAAAGGGCCCTACTG 

CACAGAGCCGATCTGCCTGCCTGGATGTGATGAGCAGCATGGATTTTGTGACAAACCAGGGGAATGCAAG 

TGCAGAGTGGGCTGGCAGGGCCGGTACTGTGACGAGTGTATCCGCTATCCAGGCTGTCTCCATGGCACCT 

GCCAGCAGCCCTGGCAGTGCAACTGCCAGGAAGGCTGGGGGGGCCITITCTGCAACCAGGACCTGAACTA 

CTGCACACACCATAAGCCCTGCAAGAATGGAGCCACCTGCACCAACACGGGCCAGGGGAGCTACACTTC 

TCTTGCCGGCCTGGGTACACAGGTGCCACCTGCGAGCTGGGGATTGACGAGTGTGACCCCAGCCCTTGTA 

AGAACGGAGGGAGCTGCACGGATCTCGAGAACAGCTACTCCTGTACCTGCCCACCCGGCTTCTACGGCAA 

AATCTGTGAATTGAGTGCCATGACCTGTGCGGACGGCCCTTGCTTTAACGGGGGTCGGTGCTCAGACAGC 

CCCGATGGAGGGTACAGCTGCCGCTGCCCCGTGGGCTACTCCGGCTTCAACTGTGAGAAGAAAATTGACT 

ACTGCAGCTCTTCACCCTGTTCTAATGGTGCCAAGTGTGTGGACCTCGGTGATGCCTACCTGTGCCGCTG 

CCAGGCCGGCTTCTCGGGGAGGCACTGTGACGACAACGTGGACGACTGCGCCTCCTCCCCGTGCGCCAAC 

GGGGGCACCTGCCGGGATGGCGTGAACGACTTCTCCTGCACCTGCCCGCCTGGCTACACGGGCAGGAACT 

GCAGTGCCCCCGTCAGCAGGTGCGAGCACGCACCCTGCCACAATGGGGCCACCTGCCACCAGAGGGGCA 

CGGCTATGTGTGCGAATGTGCCCGAAGCTACGGGGGTCCCAACTGCCAGTTCCTGCTCCCCGAGCTGCCC 

CCGGGCCCAGCGGTGGTGGACCTCACTGAGAAGCTAGAGGGCCAGGGCGGGCCATTCCCCTGGGTGGCG 

TGTGCGCCGGGGTCATCCTTGTCCTCATGCTGCTGCTGGGCTGTGCCGCTGTGGTGGTCTGCGTCCGGCT 

GAGGCTGCAGAAGCACCGGCCCCCAGCCGACCCCTGCCGGGGGGAGACGGAGACCATGAACAACCTGGC 

AACTGCCAGCGTGAGAAGGACATCTCAGTCAGCATCATCGGGGCCACGCAGATCAAGAACACCAACAAA 

AGGCGGACTTCCACGGGGACCACAGCGCCGACAAGAATGGCTTCAAGGCCCGCTACCCAGCGGTGGACA 

TAACCTCGTGCAGGACCTCAAGGGTGACGACACCGCCGTCAGGGACGCGCACAGCAAGCGTGACACCAG 



WO 03/012082 PCT/GB02/03409 

11/41 

TGCCAGCCCCAGGGCTCCTCAGGGGAGGAGAAGGGGACCCCGACCACACTCAGGGGTGGAGAAGCATCG 
AAAGAAAAAGGCCGGACTCGGGCTGTTCAACnTCAAAA 

CGAGGAGAAGGATGAGTGCGTCATAGCAACTGAGGTGTAAAATGGAAGTGAGATGGCAAGACTCCCGTT 

CTCTTAAAATAAGTAAAATTCCAAGGATATATGCCCCAACGAATGCTGCrrGAAGAGGA 

GGACTGCTGCTGAGAAACCGAGTTCAGACCGAGCAGGTTCTCCTCCTGAGGTCCTCGACGCCTGCCGA 

GCCTGTCGCGGCCCGGCCGCCTGCGGCACTGCCITCCGTGACGTCGCCGTTGCACTATGGACAGTTGCT 

TTAAGAGAATATATATTTAAATGGGTGA^ 

TTTGGATTCTrATGAGCCAGTCITITCTTGAATTAGAAACACAAACACT 

ACGAAGATGTGCTTTTTCTAGATGGAAAAGATGTGTGTTATTTTT^ 

ATATCTGTAAAGCTTGAGTATTTTGTGATGTTC 

GGCACTTCGGGTCTATGTGACTATATTTTTTTGTATATAAATGTA 

TTTGAGTTTTTTACTGTTTTGTTAATGA^ 

AGGAATTC 



Figure 10 

ATGGCGGCAGCGTCCCGGAGCGCCTCTGGCTGGGCGCH^CTGCTO 

CGGCCGGCTCCGGCGTCTTCCAGCTGCAGCTGCAGGAGTTCATCAACGAGCGCGGCGTACTGGCCAGTGG 
GCGGCCTTGCGAGCCCGGCTGCCGGACITTCTITCCGCGTCTGCCT 

CCCGGACCCTGCACCTrCGGGACCGTCTCCACGCCGGTATTGGGCACCAACTCCTTCGCTGTCCGGGACG 

ACAGTAGCGGCGGGGGGCGCAACCCTCTCCAACTGCCCTTCAATTTCACCTGGCCGGGTACCTTCT 

CATCATCGAAGCTTGGCACGCGCCAGGAGACGACCTGCGGCCAGAGGCCTTGCCACCAGATGCACTCATC 

AGCAAGATCGCCATCCAGGGCTCCCTAGCTGTGGGTCAGAACTGGTTATTGGATGAGCAAACCAGCACCC 

TCACAAGGCTGCGCTACTCTTACCGGGTCATCTGCAGTGACAACTACTATGGAGACAACTGCTCCCGCCT 

GTGCAAGAAGCGCAATGACCACTTCGGCCACTATGTGTGCCAGCCAGATGGCAACTTGTCCTGCCTGCCC 

GGTTGGACTGGGGAATATTGCCAACAGCCTATCTGTCTTTCGGGCTGTCATGAACAGAATG 

GCAAGCCAGCAGAGTGCCTCTGCCGCCCAGGCTGGCAGGGCCGGCTGTGTAACGAATGCATCCCCCACAA 

TGGCTGTCGCCACGGCACCTGCAGCACTCCCTGGCAATGTACTTGTGATGAGGGCTGGGGAGGCCTGTTT 

TGTGACCAAGATCTCAACTACTGCACCCACCACTCCCCATGCAAGAATGGGGCAACGTGCTCCAACAGTG 

GGCAGCGAAGCTACACCTGCACCTGTCGCCCAGGCTACACTGGTGTGGACTGTGAGCTGGAGCTCAGCGA 

GTGTGACAGCAACCCCTGTCGCAATGGAGGCAGCTGTAAGGACCAGGAGGATGGCTACCACTGCCTGTGT 

CCTCCGGGCTACTATGGCCTGCATTGTGAACACAGCAC 

GGGGCTCCTGCCGGGAGCGCAACCAGGGGGCCAACTATGCITGTGAATGTCCCCCCAACrTCACCGGCTC 

CAACTGCGAGAAGAAAGTGGACAGGTGCACCAGCAACCCCTGTGCCAACGGGGGACAGTGCCTGAACCA 

GGTCCAAGCCGCATGTGCCGCTGCCGTCCTGGATTCACGGGCACCTACTGTGAACTCCACGTCAGCGACT 

GTGCCCGTAACCCTTGCGCCCACGGTGGCACTTGCCATGACCTGGAGAATGGGCTCATGTGCACCTGCCC 

TGCCGGCTTCTCTGGCCGACGCTGTGAGGTGCGGACATCCATCGATGCCT 

AACAGGGCCACCTGCTACACCGACCTCTCCACAGACACCITrGTCT 

GCAGCCGCTGCGAGTTCCCCGTGGGCTTGCCGCCCAGCirCCCCTGGGTGGCCGTCTCGCTG 
GCTGGCAGTGCTGCTGGTACTGCTGGGCATGGTGGC^ 

GACGACGGCAGCAGGGAAGCCATGAACAACTTGTCGGACTTCCAGAAGGACAACCTGATTCCT 

AGCTTAAAAACACAAACCAGAAGAAGGAGCTGGAAGTGGACTGTGGCCTGGACAAGTCCAACTGTGGCA 

ACAGCAAAACCACACATTGGACTATAATCTGGCCCCAGGGCCCCTGGGGCGGGGGACCATGCCAGGAAG 

TTTCCCCACAGTGACAAGAGCTTAGGAGAGAAGGCGCCACTGCGGTTACACAGTGAAAAGCCAGAGTGC 

GGATATCAGCGATATGCTCCCCCAGGGACTCCATGTACCAGTCTGTGTGTTTGATATCAGAGGAGAGGAA 

TGAATGTGTCATTGCCACGGAGGTATAA 



Figure 11 

CTCGCAGGCTAGGAACCCGAGGCCAAGAGCTGCAGCCAAAGTCACTTC 

GCCCGCTCGAGACCCTAGGATTTGCTCCAGGACACGTACTTAGAGCAGCCACCGCCCAGTCGCCCTCACC 

TGGATTACCTACCGAGGCATCGAGCAGCGGAGTTTTTGAGAAGGCGACAAGGGAGCAGCGTCCCGAGGG 

AATCAGCrTTTTCAGGAACTCGGCTGGCAGACGGGACTTGCGGGAGAGCGACATCCCTAACAAGCAGATTC 

GGAGTCCCGGAGTGGAGAGGACACCCCAAGGGATGACGCCTGCGTCCCGGAGCGCCTGTCGCTGGGCGT 

ACTGCTGCTGGCGGTACTGTGGCCGCAGCAGCGCGCTGCGGGCrCCGGCATCTTCCAGCTGCGGCTGCAG 

GAGTTCGTCAACCAGCGCGGTATGCTGGCCAATGGGCAGTCCTGCGAACCGGGCTGCCGGACnTrCTrCC 

GCATTTGCCITAAGCACTTCCAGGCAACOT 

GGTATTGGGCACCAACTCCTTCGTCGTCAGGGACAAGAATAGCGGCAGTGGTCGCAACCCTCTGCAGTTG 
CCCTITCAATTTCACCTGGCCGGGAACCTTCTCACTC 

TGCGGCCAGAGACTTCGCCAGGAAACTCTCTCATCAGCCAAATCATCATCCAAGGCTCTCTTGCTGTGGG 
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TAAGATTTGGCGAACAGACGAGCAAAATGACACCCTCACCAGACTGAGCTACTCTTACCGGGTCATCTGC 

AGTGACAACTACTATGGAGAGAGCTGTTCTCGCCTATGCAAGAAGCGCGATGACCACTTCGGACATTATG 

AGTGCCAGCCAGATGGCAGCCTGTCCrGCCTGCCGGGCTGGACTGGGAAGTACTGTGACCAGCCTATATG 

TCTTTCn'GGCTGTCATGAGCAGAATGGTTACTGCAG.CAAGCCAGATGAGTGCATCTGCCGTCCAGGTTGG 

CAGGGTCGCCTGTGCAATGAATGTATCCCCCACAATGGCTGTCGTCATGGCACCTGCAGCATCCCCTGGC 

AGTGTGCCTGCGATGAGGGATGGGGAGGTCTGTTTTGTGACCAAGATCTCAACTACTGTACTCACCACTC 

TCCGTGCAAGAATGGATCAACGTGTTCCAACAGTGGGCCAAAGGGTTATACCTGCACCTGTCTCCCAGGC 

TACACTGGTGAGCACTGTGAGCTGGGACTCAGCAAGTGTGCCAGCAACCCCTGTCGAAATGGTGGCAGCT 

GTAAGGACCAGGAGAATAGCTACCACTGCCTGTGTCCCCCAGGCTACTATGGCCAGCACTGTGAGCATAG 

TACCirGACCTGTGCGGACTCACCCTGCTTCAATGGGGGCTCTTGCCGGGAGCGCAACCAGGGGTCCAGT 

TATGCCTGCGAATGCCCCCCCAACTTTACCGGCTCTAACTGTGAGAAGAAAGTAGACAGGTGTACCAGCA 

ACCCGTGTGCCAATGGAGGCCAGTGCCTGAACAGAGGTCCAAGCCGAACCTGCCGCTGCCGGCCTGGATT 

CACAGGCACCCACTGTGAACTGCACATCAGCGATTGTGCCCGAAGTCCCTGTGCCCACGGGGGCACTTGC 

CACGATCTGGAGAATGGGCCTGTGTGCACCTGCCCCGCTGGCTTCTCTGGCAGGCGCTGCGAGGTGCGGA 

TAACCCACGATGCCTGTGCCTCCGGACCCTGCTTCAATGGGGCCACCTGCrACACTGGCCTCTCCCCAAA 

CAACTTCGTCTGCAACTGTCCTTATGGCTTTGTGGGCAGCCGCTGCGAGTTTCCCGTGGGCTTGCCACCC 

AGCTTCCCCTGGGTAGCTGTCTCGCTGGGCGTGGGGCTAGTGGTACTGCTGGTGCTGCTGGTCATGGTGG 

TAGTGGCTGTGCGGCAGCTGCGGCTTCGGAGGCCCGATGACGAGAGCAGGGAAGCCATGAACAATCTGC 

AGACTTCCAGAAGGACAACCTAATCCCTGCCGCCCAGCTCAAAAACACAAACCAGAAGAAGGAGCTGGA 

GTGGACTGTGGTCTGGACAAGTCCAATTGTGGCAAACTGCAGAACCACACATTGGACTACAATCTAGCCC 

CGGGACTCCTAGGACGGGGCAGCATGCCTGGGAAGTATCCTCACAGTGACAAGAGCTTAGGAGAGAAGT 

GCCACTTCGGTTACACAGTGAGAAGCCAGAGTGTCGAATATCAGCCATTTGCTCTCCCAGGGACTCTATG 

TACCAATCAGTGTGTTTGATATCAGAAGAGAGGAACGAGTGTGTGATTGCCACAGAGGTATAAGGCAGA 

GCCTACTCAGACACCCAGCTCCGGCCCAGCAGCTGGGCCTTCCITCTGCATTGT^ 

ATGGGACATCTTTAGTATGCACAGTGCTGCTCTGCGGAGGAGGAGGGAATGGCATGAACTGAACAGACG 

TGAACCCGCCAAGAGTTGCACCGGCTCTGCACACCTCCAGGAGTCTGCCTGGCTTCAGATGGGCAGCCCC 

GCCAAGGGAACAGAGTTGAGGAGTTAGAGGAGCATCAGTTGAGCTGATATCTAAGGTGCCTCTCGAACTT 

GGACTTGCTCTGCCAACAGTGGTCATCATGGAGCTCTTGACTGTTCTCCAGAGAGTGGCAGTGGCCCTAG 

TGGGTCITGGCGCTGCTGTAGCTCCTGTGGGCATCTGTATTTCCAAAGTGCCTTTGCCCAGACTCCATCC 

TCACAGCTGGGCCCAAATGAGAAAGCAGAGAGGAGGCTTGCAAAGGATAGGCCTCCCGCAGGCAGAACG 

CCTTGGAGTTTGGCATTAAGCAGGAGCTACTCTGCAGGTGAGGAAAGCCCGAGGAGGGGACACGTGTGC 

TCCTGCCTCCAACCCCAGCAGGTGGGGTGCCACCTGCAGCCTCTAGGCAAGAGTTGGTCCTTCCCCTGGT 

CCTGGTGCCTCTGGGCTCATGTGAACAGATGGGCITAGGGCACGCCCCTTTTGCCAGCCAGGGGTACAGG 

CCTCACTGGGGAGCTCAGGGCCTTCATGCTAAACTCCCAATAAGGGAGATGGGGGGAAGGGGGCTGTGC 

CTAGGCCCTTCCCTCCCTCACACCCATTTTTGGGCCCTTGAGCCTGGGCrCCACCAGTGCCCACT 

CCCGAGACCAACCITGAAGCCGATTITCAAAAATCAATAATATGAGGTTTTGTTTTGTAGTTTA 

AATCTAGTATTTTGATAATTTAAGAATCAGAAGCACTGGCCTTTCTACATTTTA 

ATAATGTGTATTTATAATATGAAACAGATGTGTACATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



Figure 12 

AAACCCACTCCACCTTACTACCAGACAACCTTAGCCAAACCATTTACCCAAATAAAGTATAGGC 

GATAGAAATTGAAACCTGGCGCAATAGATATAGTACCGCAAGGGAAAGATGAAAAATTATAAC 

CAAGCATAATATAGCAAGGACTAACCCCTATACCTrCTGCATAATGAATTAACTAGAAATAACT 

TTGCAAGGAGAGTCAAAGCTAAGGCCCCCGAAACCAGGCGAGCTACCTAAGAACAGCTAAAA 

GAGCACACCCGTCTATGTAGCAAAATAGTGGGAAGATTTATAGGTAGAGGCGACAAACCTACC 

GAGCCTGGTGATAGCTGGTTGTCCAAGATAGAATCTrAGTTCAACTTTAAATTTGCCCACAGAA 

CCCTCTAAATCCCCTTGTAAATTTAACTGTTAGTCCAAAGAGGAACAGCTCTTTGGACACTAGG 

AAAAAACCTTGTAGAGAGAGTGTGAGCCCAATTCCACACTTTTCCACATGTTGGATGGCCTTGG 

AGTGGTAGCCATAAGCATTTTTGGAATTCAACTAAAAACTGAAGGATCCTTGAGGACGGCAGT 

ACCTGGCATACCTACACAGTCAGCGTTCAACAAGTGTTTGCAAAGGTACATTGGGGCACTGGG 

GGCACGAGTGATCTGTGACAATATCCCTGGTTTGGTGAGCCGGCAGCGGCAGCTGTGCCAGCGT 

TACCCAGACATCATGCGTTCAGTGGGCGAGGGTGCCCGAGAATGGATCCGAGAGTGTCAGCAC 

CAATTCCGCCACCACCGCTGGAACTGTACCACCCTGGACCGGGACCACACCGTCTTTGGCCGTG 

TCATGCTCAGAAGTAGCCGAGAGGCAGCTTTTGTATATGCCATCTCATCAGCAGGGGTGATCCA 

CGCTATTACTCGCGCCTGTAGCCAGGGTGAACTGAGTGTGTGCAGCTGTGACCCCTACACCCGT 

GGCCGACACCATGACCAGCGTGGGACTTTTGACTGGGGTGGCTGCAGTGACAACATCCACTAC 

GGTGTCCGTTTTGCCAAGGCCTTCGTGGATGCCAAGGAGAAGAGGCTTAAGGATGCCCGGGCC 

CTCATGAACTTACATAATAACCGCTGTGGTCGCACGGCTGTGCGGCGGTTTGTCAAGCTGGAGT 



WO 03/012082 



13/41 



PCT/GB02/03409 



GTAAGTGCCATGGCGTGAGTGGTTCCTGTACTCTGCGCACCTGCTGGCGTGCACTCTCAGATTT 

CCGCCGCACAGGTGATTACCTGCGGCGACGCTATGATGGGGCTGTGCAGGTGATGGCCACCCA 

AGATGGTGCCAACTTCACCGCAGCCCGCCAAGGCTATCGCCGTGCCACCCGGAGTGATCTTGTC 

TACTTTGACAACTCTCCAGATTACTGTGTCTTGGACAAGGCTGCAGGTTCCCTAGGCACTGCAG 

GCCGTGTCTGCAGCAAGACATCAAAAGGAACAGACGGTTGTGAAATCATGTGCTGTGGCCGAG 

GGTACGACACAACTCGAGTCACCCGTGTTACCCAGTGTGAGTGCAAATTCCACTGGTGCTGTGC 

TGTACGGTGCAAGGAATGCAGAAATACTGTGGACGTCCATACTTGCAAAGCCCCCAAGAAGGC 

AGAGTGGCTGGACCAGACCTGAACACACAGATACCTCACTCATCCCTCCAATTCAAGCCTCTCA 

ACTCAAAAGCACAAGATCCTTGCATGCACACCTTCCTCCACCCTCCACCCTGGGCTGCTACCGC 

TTCTATTTAAGGATGTAGAGAGTAATCCATAGGGACCATGGTGTCCTGGCTGGTTCCTTAGCCC 

TGGGAAGGAGTTGTCAGGGGATATAAGAAACTGTGCAAGCTCCCTGATTTCCCGCTCTGGAGAT 

TTGAAGGGAGAGTAGAAGAGATAGGGGGTCTTTAGAGTGAAATGAGTTGCACTAAAGTACGTA 

GTrGAGGCTCCTTTTTTCTTTCCTITGCACCAGCTTCCCGACACTTCT^ 

GGTACCTGTAGAGAGCTTCTTTTTGTITCTACCTGGCCAAAGTTAGATGGGACAAAGATGAATG 
GCATGTCCCTTCTCTGAAGTCCGTITGAGCAGAACT^^ 

GCTACCACATTCTATTATTGAGAGCCTGAGATGTTAGCCATAGTGGACAAGGTTCCATTCACAT 
GCTCATATGTTTATAAACTGTGTTTTGTAGAAGAAAAAGAATCATAACAATACAAACACACATT 
CATTCTCTCTTTTTCTCTCTACCATTCTC 

GCTGCCTGTrCAAACTGAGGTGGAATGCAGTGGTTCCCATGCTTAACAGATCATTAAAACACCC 
TAGAACACTCCTAGGATAGATTAATGT 

Figure 13 



ACCGCAGGGGGCTCCCGGACCCTGACTCTGCAGCCGAACCGGCACGGTTTCGTGGGGACCCAG 

GCTTGCAAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCTTGAGTCCTTCTGAGATGATGGCTCT 

GGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGGTAGCGGCGGCTCTCGGCGGCCACCC 

TCTGCTGGGAGTGAGCGCCACCrrGAACTCGGTTCTCAATTCCAACGCTATCAAGAACCTGCCC 

CCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTCTGCAGTCAGCGCCGCGCCGGGAATCCTG 

TACCCGGGCGGGAATAAGTACCAGACCATTGACAACTACCAGCCGTACCCGTGCGCAGAGGAC 

GAGGAGTGCGGCACTGATGAGTACTGCGCTAGTCCCACCCGCGGAGGGGACGCAGGCGTGCAA 

ATCTGTCTCGCCTGCAGGAAGCGCCGAAAACGCTGCATGCGTCACGCTATGTGCTGCCCCGGGA 

ATTACTGCAAAAATGGAATATGTGTGTCTTCTGATCAAAATCATTTCCGAGGAGAAATTGAGGA 

AACCATCACTGAAAGCTTTGGTAATGATCATAGCACCTTGGATGGGTATTCCAGAAGAACCACC 

TTGTCTTCAAAAATGTATCACACCAAAGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACT 

GTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGTCCAAGATCTGTAAACCTGTCCTGAAAGA 

AGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCTCTCATGGACTAGAAATATTCCAGCGTTG 

TTACTGTGGAGAAGGTCTGTCTTGCCGGATACAGAAAGATCACCATCAAGCCAGTAATTCTTCT 

AGGCITCACACTTGTCAGAGACACTAAACCAGCTATCCAAATGCAGTGAACTCCTTTTATATAA 

TAGATGCTATGAAAACCTTITATGACCTTCATCAACTCAATCCTAAGGATATACAAGTTCTGTG 

GTTTCAGTTAAGCATTCCAATAACACCTTCCAAAAACCTGGAGTGTAAGAGCTITGTITCTTTAT 

GGAACTCCCCTGTGATTGCAGTAAATTACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAA 

ATGCAATGAAACTTTrAATTATTTTTCTAAAGGTGCTGCACTGCCTATTTTrCCT 

AATTTTTGTACACATTGATTGTTATCTTGACTGACAAATATTCTATATTGAACTGAAGT 

TTTCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAGAGTCTAGAACGCAA 

GGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACATGAAAATACTAGCTrATTTTC 

TGAAATGTACTATCTTAATGCTTAAATTATATTTC^ 

ATTTAACATTTAATATCATGAAATGTTATAA 



Figure 14 



AGAAAGCGGGAGCCCGCGGCGAGCGTAGCGCAAGTCCGCTCCCTAGGCATCGCTGCGCTGGCA 
GCGATTCGCTGTCTCTTGTGAGTCAGGGGACAACGCTTCGGGGCAACTGTGAGTGCGCGTGTGG 
GGGACCTCGATTCTCTTCAGATCTCGAGGA1TCGGTCCGGGGACGTCTCCTGATCCCCTACTAA 
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AGCGCCTGCTAACTTTGAAAAGGAGCACTGTGTCCTGCAAAGTTTGACACATAAAGGATAGGA 

AAAGAGAGGAGAGAAAAGCAACTGAGTTGAAGGAGAAGGAGCTGATGCGGGCCTCCTGATCA 

ATTAAGAGGAGAGTTAAACCGCCGAGATCCCGGCGGGACCAAGGAGGTGCGGGGCAAGAAGG 

AACGGAAGCGGTGCGATCCACAGGGCTGGGTTTTCTTGCACCTTGGGTCACGCCTCCTTGGCGA 

GAAAGCGCCTCGCATTTGATTGCTTCCAGTTATTGCAGAACTTCCTGTCCTGGTGGAGAAGCGG 

GTCTCGCTrGGGTTCCGCTAATTTCTGTCCTGAGGCGTGAGACTGAGTTCATAGGGTCCTGGGTC 

CCCGAACCAGGAAGGGTTGAGGGAACACAATCTGCAAGCCCCCGCGACCCAAGTGAGGGGCCC 

CGTGTrGGGGTCCTCCCtCCCTTrGCATTCCCACCCCTCCGGGCTTTGCGTCTTCCTGGGGACCC 

CCTCGCCGGGAGATGGCCGCGTTGATGCGGAGCAAGGATTCGTCCTGCTGCCTGCTCCTACTGG 

CCGCGGTGCTGATGGTGGAGAGCTCACAGATCGGCAGTTCGCGGGCCAAACTCAACTCCATCA 

AGTCCTCTCTGGGCGGGGAGACGCCTGGTCAGGCCGCCAATCGATCTGCGGGCATGTACCAAG 

GACTGGCATTCGGCGGCAGTAAGAAGGGCAAAAACCTGGGGCAGGCCTACCCTTGTAGCAGTG 

ATAAGGAGTGTGAAGTTGGGAGGTATTGCCACAGTCCCCACCAAGGATCATCGGCCTGCATGG 

TGTGTCGGAGAAAAAAGAAGCGCTGCCACCGAGATGGCATGTGCTGCCCCAGTACCCGCTGCA 

ATAATGGCATCTGTATCCCAGTTACTGAAAGCATCTTAACCCCTCACATCCCGGCTCTGGATGG 

TACTCGGCACAGAGATCGAAACCACGGTCATTACTCAAACCATGACTTGGGATGGCAGAATCT 

AGGAAGACCACACACTAAGATGTCACATATAAAAGGGCATGAAGGAGACCCCTGCCTACGATC 

ATCAGACTGCATTGAAGGGTTTTGCTGTGCTCGTCATTTCTGGACCAAAATCTGCAAACCAGTG 

CTCCATCAGGGGGAAGTCTGTACCAAACAACGCAAGAAGGGTTCTCATGGGCTGGAAATTTTC 

CAGCGTTGCGACTGTGCGAAGGGCCTGTCTTGCAAAGTATGGAAAGATGCCACCTACTCCTCCA 

AAGCCAGACTCCATGTGTGTCAGAAAATTTGATCACCATTGAGGAACATCATCAATTGCAGACT 

GTGAAGTTGTGTATTTAATGCATTATAGCATGGTGGAAAATAAGGTTCAGATGCAGAAGAATG 

GCTAAAATAAGAAACGTGATAAGAATATAGATGATCACAAAAAGGGAGAAAGAAAACATGAA 

CTGAATAGATTAGAATGGGTGACAAATGCAGTGCAGCCAGTGTTTCCATTATGCAACTTGTCTA 

TGTAAATAATGTACACATTTGTGGAAAATGCTATTATTAAGAGAACAAGCACACAGTGGAAAT 

TACTGATGAGTAGCATGTGACTITCCAAGAGTTTAGGTTGTGCTGGAGGAGAGGTTTCCTTCAG 

ATTGCTGATTGCTTATACAAATAACCTACATGCCAGATTTCTATTCAACGTTAGAGTTTAACAA 

AATACTCCTAGAATAACTTGTTATACAATAGGTTCTAAAAATAAAATTGCTAAACAAGAAATGA 

AAACATGGAGCATTGTTAATTTACAACAGAAAATTACCTTTTGATTTGTAACACTACTTCT 

TTCAATCAAGAGTCTTGGTAGATAAGAAAAAAATCAGTCAATATTTCCAAATAATTGCAAAATA 

ATGGCCAGTTGTTTAGGAAGGCCTITAGGAAGACAAATAAATAACAAACAAACAGCCACAAAT 

ACTTTTTTTTCAAAATTTTAGTTTTACCTC 

CCTTCAGATTCTACGGAATGACAGTATATCTCTCTTTATCCTATGTGATTCCTGCTCTGAATGCA 

TTATATTITCCAAACTATACCCATAAATTGTGACTAGTAAAATACTTACACAGAGCAGAATTTT 

CACAGATGGCAAAAAAATTTAAAGATGTCCAATATATGTGGGAAAAGAGCTAACAGAGAGATC 

ATTAlTrCTTAAAGATTGGCCATAACCTGTATTTrGATAGAATTAGATTGGTAAATACATGTATT 

CATACATACTCTGTGGTAATAGAGACTrGAGCTGGATCTGTACTGCACTGGAGTAAGCAAGAA 

AATTGGGAAAACTTTTTCGTTTGTTCAGGTITrGGCAACACATAGATCATATGTCTGAGGCACA 

AGTTGGCTGTTCATCTTTGAAACCAGGGGATGCACAGTCTAAATGAATATCTGCATGGGATTTG 

CTATCATAATATTTACTATGCAGATGAATTCAGTGTGAGGTCCTGTGTCCGTACTATCCTCAAAT 

TATTTATTTTATAGTGCTGAGATCCTCAAATAATCTCAATTrCAGGAGGTTTCACAAAATGGACT 

CCTGAAGTAGACAGAGTAGTGAGGTITCATTGCCCTCTATAAGCTTCTGACTAGCCAATGGCAT 

CATCCAATirrCTTCCCAAACCTCTGCAGCATCTGCTTTATrGCCAAAGGGCTAGTTTCGG 

CTGCAGCCATTGCGGTTAAAAAATATAAGTAGGATAACTTGTAAAACCTGCATATTGCTAATCT 

ATAGACACCACAGTTTCTAAATTCTITGAAACCACTTTACTACl 1 1 1 1 1 l AAACTTAACTCAGTT 

CTAAATACITrGTCTGGAGCACAAAACAATAAAAGGTTATCTTATAGTCGTGACTITAAACTTT 

TGTAGACCACAATTCACTTTTTAGTTTTCTTTrACTTAAATCCCATCT 

TCTCCCAGTAGAGATTGAGTTTGAGCCTGTATATCTATTAAAAATTTCAACTTCCCACATATATT 

TACTAAGATGATTAAGACTTACATTTTCTGCACAGGTCTGCAAAAACAAAAATTATAAACTAGT 

CCATCCAAGAACCAAAGTTTGTATAAACAGGTTGCTATAAGCTTGGTGAAATGAAAATGGAAC 

ATTTCAATCAAACATTTCGTATATAACAATTATTATATT^ 

TTATGTCCACCCTTTTAAAAATTATTATTTGAAGTAATTTATTTACAGGAAATG 

TATTTTCTTATAGAGATATTTCTTACAGAAAGCTTTGTAGCAGAATATATTTGCAGCTATT 

TTGTAATTTAGGAAAAATGTATAATAAGATAAAATCTATTAAATTITTCTCCTCT 

ATTCAAAGC 
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ACACACAGGCGGCGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTG 

CCTGCTGCTGGCGGCGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCA 

GTCAAGCCCGGCCCGGCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGC 

GAGGTTGAGGAACTGATGGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGA 

GGCAGAAGAAGCTGCTGCTAAAGCATCATCAGAAGTGAACCTGGCAAACTTACCTCCCAGCTA 

TCACAATGAGACCAACACAGACACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAAT 

TCACAAGATAACCAACAACCAGACTGGACAAATGGTCTTTTCAGAGACAGTTATCACATCTGTG 

GGAGACGAAGAAGGCAGAAGGAGCCACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCAT 

GTACTGCCAGTTTGCCAGCTTCCAGTACACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGC 

ACCCGGGACAGTGAGTGCTGTGGAGACCAGCTGTGTGTCTGGGGTCACTGCACCAAAATGGCC 

ACCAGGGGCAGCAATGGGACCATCTGTGACAACCAGAGGGACTGCCAGCCGGGGCTGTGCTGT 

GCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGCACACCCCTGCCCGTGGAGGGCGAGCTTTGCC 

ATGACCCCGCCAGCCGGCTTCTGGACCTCATCACCTGGGAGCTAGAGCCTGATGGAGCCTTGGA 

CCGATGCCCTTGTGCCAGTGGCCTCCJ'CTGCCAGCCCCACAGCCACAGCCTGGTGTATGTGTGC 

AAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGGGAGATCCTGCTGCCCAGAGAGGTCCCC 

GATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCGCCAGGAGCTGGAGGACCTGGAGAGG 

AGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTGCCGCCGCTGCACTGCTGGGAGGG 

GAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAATAGAAATAGCTAATTTATTTCC 

CCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTACATCTTCTrCCCAGTAAGTTTC 

CCCTCTGGCrTGACAGCATGAGGTGTTGTGCATTTGTTCAGCTCCCCCAGGCTGTTCTCCAGGCT 

TCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCAGGAGCAGTTTGCCACCCCTGT 

CCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCGTTTGTTCTACATGGCTTTGAT 

AATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTCTGATTGGTTTTGGGGAAATG 

TGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATGCAACAAATGAATTTTCCA 

CGCAGTTCITrCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGCAGATGAAATGTTCTGTTC 

ACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCCTACCTCTGTGCCAGGGCA 

GCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGGAGGGGGTCATTGTTCTCC 

TCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCCCAAGTCACACAGCTAGT 

GAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCTCTCCACTACCCCACAC 

CAGCCTTGGTOKXACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 

TGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAACTACTGTTAGGA 

ACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGATATTGACACTGT 

CCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCATACAGGTTAA 

CCTGCAGAAACAGTACTTAGGTAA1TGTAGGGCGAGGATTATAAATGAAATTTGCAAAATCAC 

TTAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGGCTGTGTG 

AAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATGTTTTC 

AGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCITAAAGTTTAAAGTTGCACATG 

ATTGTATAAGCATGCITrCTTTGAGTTTTAAATTATGTATAAACATAAGT^^ 

AGCATAAATCACTTCAACTGCTCTTCT 



Figure 16 



GACAAACAGACGACGTGCTGAGCTGCCAGCTTAGTGGAAGCTCTGCTCTGGGTGGAGAGCAGC 

CrCGCrTTGGTGACGCACAGTGCTGGGACCCTCCAGGAGCCCCGGGATTGAAGGATGGTGGCG 

GCCGTCCTGCTGGGGCTGAGCTGGCTCTGCTCTCCCCTGGGAGCTCTGGTCCTGGACTTCAACA 

ACATCAGGAGCTCTGCTGACCTGCATGGGGCCCGGAAGGGCTCACAGTGCCTGTCTGACACGG 

ACTGCAATACCAGAAAGTTCTGCCTCCAGCCCCGCGATGAGAAGCCGTTCTGTGCTACATGTCG 

TGGGTTGCGGAGGAGGTGCCAGCGAGATGCCATGTGCTGCCCTGGGACACTCTGTGTGAACGA 

TGTTTGTACTACGATGGAAGATGCAACCCCAATATTAGAAAGGCAGCTTGATGAGCAAGATGG 

CACACATGCAGAAGGAACAACTGGGCACCCAGTCCAGGAAAACCAACCCAAAAGGAAGCCAA 

GTATTAAGAAATCACAAGGCAGGAAGGGACAAGAGGGAGAAAGTTGTCTGAGAACTTTTGACT 

GTGGCCCTGGACTTTGCTGTGCTCGTCATTTTTGGACGAAAATITGTAAGCCAGTCCTm 

GGACAGGTCTGCTCCAGAAGAGGGCATAAAGACACTGCTCAAGCTCCAGAAATCTTCCAGCGT 
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TGCGACTGTGGCCCTGGACTACTGTGTCGAAGCCAATTGACCAGCAATCGGCAGCATGCTCGAT 
TAAGAGTATGCCAAAAAATAGAAAAGCTATAAATATTTCAAAATAAAGAAGAATCCACATTGC 

ATTTGAG 



Figure 17 

ATGGGGCTCTGGGCGCTGTTGCCTGGCTGGGTTTCTGCTACGCTGCTGCTGGCGCTGGCCGCTCT 

GCCCGCAGCCCTGGCTGCCAACAGCAGTGGCCGATGGTGGGGTATTGTGAACGTAGCCTCCTCC 

ACGAACCTGCTTACAGACTCCAAGAGTCTGCAACTGGTACTCGAGCCCAGTCTGCAGCTG1TGA 

GCCGCAAACAGCGGCGCCTGATACGCCAAAATCCGGGGATCCTGCACAGCGTGAGTGGGGGGC 

TGCAGAGTGCCGTGCGCGAGTGCAAGTGGCAGTTCCGGAATCGCCGCTGGAACTGTCCCACTG 

CTCCAGGGCCCCACCTCTTCGGCAAGATCGTCAACCGAGGCTGTCGAGAAACGGCGTTTATCTT 

CGCTATCACCTCCGCCGGGGTCACCCATTCGGTGGCGCGCTCCTGCTCAGAAGGTTCCATCGAA 

TCCTGCACGTGTGACTACCGGCGGCGCGGCCCCGGGGGCCCCGACTGGCACTGGGGGGGCTGC 

AGCGACAACATTGACTTCGGCCGCCTCTTCGGCCGGGAGTTCGTGGACTCCGGGGAGAAGGGG 

CGGGACCTGCGCTTCCTCATGAACCTTCACAACAACGAGGCAGGCCGTACGACCGTATTCTCCG 

AGATGCGCCAGGAGTGCAAGTGCCACGGGATGTCCGGCTCATGCACGGTGCGCACGTGCTGGA 

TGCGGCTGCCCACGCTGCGCGCCGTGGGCGATGTGCTGCGCGACCGCTTCGACGGCGCCTCGCG 

CGTCCTGTACGGCAACCGCGGCAGCAACCGCGCTTCGCGAGCGGAGCTGCTGCGCCTGGAGCC 

GGAAGACCCGGCCCACAAACCGCCCTCCCCCCACGACCTCGTCTACTTCGAGAAATCGCCCAAC 

TTCTGCACGTACAGCGGACGCCTGGGCACAGCAGGCACGGCAGGGCGCGCCTGTAACAGCTCG 

TCGCCCGCGCTGGACGGCTGCGAGCTGCTCTGCTGCGGCAGGGGCCACCGCACGCGCACGCAG 

CGCGTCACCGAGCGCTGCAACTGCACCTTCCACTGGTGCTGCCACGTCAGCTGCCGCAACTGCA 

CGCACACGCGCGTACTGCACGAGTGTCTGTGA 



Figure 18 

AGCAGAGCGGACGGGCGCGCGGGAGGCGCGCAGAGCTTTCGGGCTGCAGGCGCTCGCTGCCGC 

TGGGGAATTGGGCTGTGGGCGAGGCGGTCCGGGCTGGCCTTTATCGCTCGCTGGGCCCATCGTT 

TGAAACTTTATCAGCGAGTCGCCACTCGTCGCAGGACCGAGCGGGGGGCGGGGGCGCGGCGAG 

GCGGCGGCCGTGACGAGGCGCTCCCGGAGCTGAGCGCTTCTGCTCTGGGCACGCATGGCGCCC 

GCACACGGAGTCTGACCTGATGCAGACGCAAGGGGGTTAATATGAACGCCCCTCTCGGTGGAA 

TCTGGCTCTGGCTCCCTCTGCTCTTGACCTGGCTCACCCCCGAGGTCAACTCTTCATGGTGGTAC 

ATGAGAGCTACAGGTGGCTCCTCCAGGGTGATGTGCGATAATGTGCCAGGCCTGGTGAGCAGC 

CAGCGGCAGCTGTGTCACCGACATCCAGATGTGATGCGTGCCATTAGCCAGGGCGTGGCCGAG 

TGGACAGCAGAATGCCAGCACCAGTTCCGCCAGCACCGCTGGAATTGCAACACCCTGGACAGG 

GATCACAGCCTTTTTGGCAGGGTCCTACTCCGAAGTAGTCGGGAATCTGCCT1TGTTTATGCCAT 

CTCCTCAGCTGGAGTTGTATTTGCCATCACCAGGGCCTGTAGCCAAGGAGAAGTAAAATCCTGT 

TCCTGTGATCCAAAGAAGATGGGAAGCGCCAAGGACAGCAAAGGCATTTTTGATTGGGGTGGC 

TGCAGTGATAACATTGACTATGGGATCAAATTTGCCCGCGCATTTGTGGATGCAAAGGAAAGG 

AAAGGAAAGGATGCCAGAGCCCTGATGAATCTTCACAACAACAGAGCTGGCAGGAAGGCTGTA 

AAGCGGTTCTTGAAACAAGAGTGCAAGTGCCACGGGGTGAGCGGCTCATGTACTCTCAGGACA 

TGCTGGCTGGCCATGGCCGACTTCAGGAAAACGGGCGATTATCTCTGGAGGAAGTACAATGGG 

GCCATCCAGGTGGTCATGAACCAGGATGGCACAGGTTTCACTGTGGCTAACGAGAGGTTTAAG 

AAGCCAACGAAAAATGACCTCGTGTATrrTGAGAATTCTCCAGACTACTGTATCAGGGACCGAG 

AGGCAGGCTCCCTGGGTACAGCAGGCCGTGTGTGCAACCTGACTTCCCGGGGCATGGACAGCT 

GTGAAGTCATGTGCTGTGGGAGAGGCTACGACACCTCCCATGTCACCCGGATGACCAAGTGTG 

GGTGTAAGTTCCACTGGTGCTGCGCCGTGCGCTGTCAGGACTGCCTGGAAGCTCTGGATGTGCA 

CACATGCAAGGCCCCCAAGAACGCTGACTGGACAACCGCTACATGACCCCAGCAGGCGTCACC 

ATCCACCTTCCCITCTACAAGGACTCCATTGGATCTGCAAGAACACTGGACCTTTGGGTTCTTTC 

TGGGGGGATATTTCCTAAGGCATGTGGCCTTTATCTCAACGGAAGCCCCCTCTTCCTCCCTGGG 

GGCCCCAGGATGGGGGGCCACACGCTGCACCTAAAGCCTACCCTATTCTATCCATCTCCTGGTG 

TTCTGCAGTCATCTCCCCTCCTGGCGAGTTCTCTTTGGAAATAGCATGACAGGCTGTTCAGCCGG 

GAGGGTGGTGGGCCCAGACCACTGTCTCCACCCACCTTGACGTTTCTTCTTTCTAGAGCAGTTG 
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GCCAAGCAGAAAAAAAAGTGTCTCAAAGGAGCTTTCTCAATGTCTTCCCACAAATGGTCCC^ 

TAAGAAATTCCATACITCrCrCAGATGGAACAGTAAAGAAAGCAGAATCAACTGCCCCTGACTT 

AACTTTAACrmGAAAAGACCAAGACTTTTGTCTGTACAAGTGGTITrACAGCTACCACCCTTA 

GGGTAATTGGTAATTACCTGGAGAAGAATGGCTTTCAATACCCTTTTAAGTTTAAAATGTGTAT 

TTTTCAAGGCATTTATTGCCATATTAAAATCTGATGTAACAAGGTGGGGACGTGTGTCCTTTGGT 

ACTATGGTGTGTTGTATCTITGTAAGAGCAAAAGCCTCAGAAAGGGATTGCTTTGCATTACTGT 

CCCCTTGATATAAAAAATCTTTAGGGAATGAGAGTTCCTTCTCACTTAGAATCTGAAGGGAA'TT 

AAAAAGAAGATGAATGGTCTGGCAATATTCTGTAACrATTC 

TTAGTGGATGGAATATCAGAAGTATATCTGTACAGATCAAGAAAAAAAGGAAGAATAAAATTC 
CTATATCAT 



Figure 19 

CGGGAGTCTTCGGGGAGCTATGCTGAGACCGGGTGGTGCGGAGGAAGCTGCGCAGCTCCCGCT 

TCGGCGCGCCAGCGCCCCGGTCCCTGTGCCGTCGCCCGCGGCCCCCGACGGCTCCCGGGCTTCG 

GCCCGCCTAGGTCTTGCCTGCCTTCTGCTCCTGCTGCTGCTGACGCTGCCGGCCCGCGTAGACAC 

GTCCTGGTGGTACATTGGGGCACTGGGGGCACGAGTGATCTGTGACAATATCCCTGGTTTGGTG 

AGCCGGCAGCGGCAGCTGTGCCAGCGTTACCCAGACATCATGCGTTCAGTGGGCGAGGGTGCC 

CGAGAATGGATCCGAGAGTGTCAGCACCAATTCCGCCACCACCGCTGGAACTGTACCACCCTG 

GACCGGGACCACACCGTCTTTGGCCGTGTCATGCTCAGAAGTAGCCGAGAGGCAGCTnTGTAT 

ATGCCATCTCATCAGCAGGGGTAGTCCACGCTATTACTCGCGCCTGTAGCCAGGGTGAACTGAG 

TGTGTGCAGCTGTGACCCCTACACCCGTGGCCGACACCATGACCAGCGTGGGGACTTTGACTGG 

GGTGGCTGCAGTGACAACATCCACTACGGTGTCCGTTTTGCCAAGGCCTTCGTGGATGCCAAGG 

AGAAGAGGCTTAAGGATGCCCGGGCCCTCATGAACTTACATAATAACCGCTGTGGTCGCACGG 

CTGTGCGGCGGTTTCTGAAGCTGGAGTGTAAGTGCCATGGCGTGAGTGGTTCCTGTACTCTGCG 

CACCTGCTGGCGTGCACTCTCAGATTTCCGCCGCACAGGTGATTACCTGCGGCGACGCTATGAT 

GGGGCTGTGCAGGTGATGGCCACCCAAGATGGTGCCAACTTCACCGCAGCCCGCCAAGGCTAT 

CGCCGTGCCACCCGGACTGATCTTGTCTACTTTGACAACTCrCCAGATTACTGTGTCTTGGACAA 

GGCTGCAGGTTCCCTAGGCACTGCAGGCCGTGTCTGCAGCAAGACATCAAAAGGAACAGACGG 

TTGTGAAATCATGTGCTGTGGCCGAGGGTACGACACAACTCGAGTCACCCGTGTTACCCAGTGT 

GAGTGCAAATTCCACTGGTGCTGTGCTGTACGGTGCAAGGAATGCAGAAATACTGTGGACGTC 

CATACTrGCAAAGCCCCCAAGAAGGCAGAGTGGCTGGACCAGACCTGAACACACAGATACCTC 

ACTCATCCCTCCAATTCAAGCCTCTCAACTCAAAAGCACAAGATCCTTGCATGCACACCTTCCT 

CCACCCTCCACCCTGGGCTGCTACCGCTTCTATTTAAGGATGTAGAGAGTAATCCATAGGGACC 

ATGGTGTCCTGGCTGGTTCCTTAGCCCTGGGAAGGAGTTGTCAGGGGATATAAGAAACTGTGCA 

AGCTCCCTGATTTCCCGCTCTGGAGATTTGAAGGGAGAGTAGAAGAGATAGGGGGTCTTTAGA 

GTGAAATGAGTTGCACrAAAGTACGTAGTTGAGGCTCCTTTTTTCTTTCCTTTGCACCAGCT^ 

CGACACTTCTTGGTGTGCAAGAGGAAGGGTACCTGTAGAGAGCTrCTITITGTTTCTACCTGGC 

CAAAGTTAGATGGGACAAAGATGAATGGCATGTCCCTTCTCTGAAGTCCGTTTGAGCAGAACTA 

CCTGGTACCCCGAAAGAAAAATCTTAGGCTACCACA'rTCTATTATTGAGAGCCTGAGATGTTAG 

CCATAGTGGACAAGGTTCCATTCACATGCTCATATGTTTATAAACTGTGTTTTGTAGAAGAAAA 

AGAATCATAACAATACAAACACACATTCATTCTCTCTTTTTCTCTCTACCATTCTCAACCTGTAT 

TGGACAGCACTGCCTCTTTrGCTTACTTGCTGCCTGTTCAAACTGAGGTGGAATGCAGTGGTTCC 

CATGCTTAACAGATCATTAAAACACCCTAGAACACTCCTAGGATAGATTAATGT 



Figure 20 

GCGCTTCTGACAAGCCCGAAAGTCATTTCCAATCTCAAGTGGACTTTGTTCCAACTATTGGGGG 

CGTCGCTCCCCCTCYTCATGGTCGCGGGCAAACTTCCTCCTCGGCGCCTCTTCTAATGGAGCCCC 

ACCTGCTCGGGCTGCTCCTCGGCCTCCTGCTCGGTGGCACCAGGGTCCTCGCTGGCTACCCAAT 

TTGGTGGTCCCTGGCCCTGGGCCAGCAGTACACATCTCTGGGCTCACAGCCCCTGCTCTGCGGC 

TCCATCCCAGGCCTGGTCCCCAAGCAACTGCGCTTCTGCCGCAATTACATCGAGATCATGCCCG 
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CGTGGCCGAGGGCGTGAAGCTGGGCATCCAGGAGTGCCAGCACCAGTTCCGGGGCCGCCGCT 

GGAACTGCACCACCATAGATGACAGCCTGGCCATCTTTGGGCCCGTCCTCGACAAAGCCACCCG 

CGAGTCGGCCTTCGTTCACGCCATCGCCTCGGCCGGCGTGGCCTTCGC^ 

GCCGAGGGCACCTCCACCATTTGCGGCTGTGACTCGCATCATAAG^ 

TGGAAGTGGGGCGGCTGCAGCGAGGACGCTGACirCGGCGTGTTAGTGTCCAGGGAGTrCGGG 
GATGCGCGCGAGAACAGGCCGGACGCGCGCTCGGCCATGAACAAGCACAACAACGAGGCGGG 
CCGCACGACTATCCT^ACCACATGCACCTCAAATGCAAGTGCCACGGGCT 
GAGGTGAAGACCT^GTGGGCGCAGCCTGACTTC 

AGTATGACAGCGCCTCGGAGATGGTAGTAGAGAAGCACCGTGAGTCCCGAGGCTGGGTGGAGA 

CCCTCCGGGCCAAGTACTCGCTCTrCAAGCCACCCACGGAGAGGGACCTGGTCTACTACGAGA 

ACTCCCCCAACTTITGTGAGCCCAACCCAGAGACGGGTTCCTTTGGCACAAGGGACCGGACTTG 

CAATGTCACCTCCCACGGCATCGATGGCTGCGATCTGCTCrGCTGTGGCCGGGGCCACAACACG 

AGGACGGAGAAGCGGAAGGAAAAATGCCACTGCATCTTCCACTGGTGCTGCTACGTCAGCTGC 

CAGGAGTGTATTCGCATCTACGACGTGCACACCTGCAAGTAGGGCACCAG 



Figure 21 

ATGAGTCCCCGCTCGTGCCTGCGTTCGCTGCGCCrCCTCGTCTTCGCCGTCTTCTCAGCCGCCGC 

GAGCAACTGGCTGTACCTGGCCAAGCTGTCGTCGGTGGGGAGCATCTCAGAGGAGGAGACGTG 

CGAGAAACTCAAGGGCCTGATCCAGAGGCAGGTGCAGATGTGCAAGCGGAACCTGGAAGTCAT 

GGACTCGGTGCGCCGCGGTGCCCAGCTGGCCATTGAGGAGTGCCAGTACCAGTTCCGGAACCG 

GCGCTGGAACTGCTCCACACTCGACTCCTTGCCCGTCTTCGGCAAGGTGGTGACGCAAGGGATT 

CGGGAGGCGGCCTTGGTGTACGCCATCTCTTCGGCAGGTGTGGCCTTTGCAGTGACGCGGGCGT 

GCAGCAGTGGGGAGCTGGAGAAGTGCGGCTGTGACAGGACAGTGCATGGGGTCAGCCCACAG 

GGCTTCCAGTGGTCAGGATGCTCTGACAACATCGCCTACGGTGTGGCCTTCTCACAGTCGTTTG 

TGGATGTGCGGGAGAGAAGCAAGGGGGCCTCGTCCAGCAGAGCCCTCATGAACCTCCACAACA 

ATGAGGCCGGCAGGAAGGCCATCCTGACACACATGCGGGTGGAATGCAAGTGCCACGGGGTGT 

CAGGCTCCTGTGAGGTAAAGACGTGCTGGCGAGCCGTGCCGCCCTTCCGCCAGGTGGGTCACG 

CACTGAAGGAGAAGTTTGATGGTGCCACTGAGGTGGAGCCACGCCGCGTGGGCTCCTCCAGGG 

CACTGGTGCCACGCAACGCACAGTTCAAGCCGCACACAGATGAGGACTTGGTGTACTTGGAGC 

CTAGCCCCGACTTCTGTGAGCAGGACATGCGCAGCGGCGTGCTGGGCACGAGGGGCCGCACAT 

GCAACAAGACGTCCAAGGCCATCGACGGCTGTGAGCTGCTGTGCTGTGGCCGCGGCTTCCACA 

CGGCGCAGGTGGAGCTGGCTGAACGCTGCAGCTGCAAATTCCACTGGTGCTGCTTCGTCAAGTG 

CCGGCAGTGCCAGCGGCTCGTGGAGTTGCACACGTGCCGATGA 



Figure 22 

ATTAATTCTGGCTCCACTTGTTGCTCGGCCCAGGTTGGGGAGAGGACGGAGGGTGGCCGCAGC 

GGGTTCCTGAGTGAATTACCCAGGAGGGACTGAGCACAGCACCAACTAGAGAGGGGTCAGGGG 

GTGCGGGACTCGAGCGAGCAGGAAGGAGGCAGCGCCTGGCACCAGGGCTTTGACTCAACAGA 

ATTGAGACACGTTTGTAATCGCTGGCGTGCCCCGCGCACAGGATCCCAGCGAAAATCAGATTTC 

CTGGTGAGGTTGCGTGGGTGGATrAATTTGGAAAAAGAAACTGCCTATATCTTGCCATCAAAAA 

ACTCACGGAGGAGAAGCGCAGTCAATCAACAGTAAACTTAAGAGACCCCCGATGCTCCCCTGG 

TTTAACTTGTATGCTTGAAAATTATCTGAGAGGGAATAAACATCTTITCC^ 

AAGTCCATTGGAATATTAAGCCCAGGAGTTGCTTTGGGGATGGCTGGAAGTGCAATGTCTTCCA 

AGTTCTrCCTAGTGGCITrGGCCATATTTTTCTCCITCGCCCAGGTTGTAATTGAAGCC 

GGTGGTCGCTAGGTATGAATAACCCTGTTCAGATGTCAGAAGTATATATTATAGGAGCACAGCC 

TCrCTGCAGCCAACTGGCAGGACTTTCTCAAGGACAGAAGAAACTGTGCCACTTGTATCAGGAC 

CACATGCAGTACATCGGAGAAGGCGCGAAGACAGGCATCAAAGAATGCCAGTATCAATTCCGA 

CATCGACGGTGGAACTGCAGCACTGTGGATAACACCTCTGTTTTTGGCAGGGTGATGCAGATAG 

GCAGCCGCGAGACGGCCTTCACATACGCCGTGAGCGCAGCAGGGGTGGTGAACGCCATGAGCC 

GGGCGTGCCGCGAGGGCGAGCTGTCCACCTGCGGCTGCAGCCGCGCCGCGCGCCCCAAGGACC 
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TGCCGCGGGACTGGCTCTGGGGCGGCTGCGGCGACAACATCGACTATGGCTACCGCTTTGCCAA 

GGAGTTCGTGGACGCCCGCGAGCGGGAGCGCATCCACGCCAAGGGCTCCTACGAGAGTGCTCG 

CATCCTCATGAACCTGCACAACAACGAGGCCGGCCGCAGGACGGTGTACAACCTGGCTGATGT 

GGCCTGCAAGTGCCATGGGGTGTCCGGCTCATGTAGCCTGAAGACATGCTGGCTGCAGCTGGC 

AGACTTCCGCAAGGTGGGTGATGCCCTGAAGGAGAAGTACGACAGCGCGGCGGCCATGCGGCT 

CAACAGCCGGGGCAAGTTGGTACAGGTCAACAGCCGCTTCAACTCGCCCACCACACAAGACCT 

GGTCTACATCGACCCCAGCCCTGACTACTGCGTGCGCAATGAGAGCACCGGCTCGCTGGGCAC 

GCAGGGCCGCCTGTGCAACAAGACGTCGGAGGGCATGGATGGCTGCGAGCTCATGTGCTGCGG 

CCGTGGGTACGACCAGTTCAAGACCGTGCAGACGGAGCGCTGCCACTGCAAGTTCCACTGGTG 

CTGCTACGTCAAGTGCAAGAAGTGCACGGAGATCGTGGACCAGTTTGTGTGCAAGTAGTGGGT 

GCCACCCAGCACTCAGCCCCGCTCCCAGGACCCGCTTATTTATAGAAAGTACAGTGATTCTGGT 

TTTTGGTTTITAGAAATATTTTTTATT^ 

TTACCATCTAAGAACTCTGTGGTTTATTATTAATATTATAATTATTATTTGGCAATAATGGGGGT 
GGGAACCACGAAAAATATTTATTTTGTGGATCTTTGAAAAGG 

AGTATAGAATGAAGGGGGAAATAACACATACCCTAACTTAGCTGTGTGGGACATGGTACACAT 

CCAGAAGGTAAAGAAATACATTTTCTTTTTCTCAAATATGCCATCATATGGGATGGGTAGGTTC 

CAGTTGAAAGAGGGTGGTAGAAATCTATTCACAATTCAGCTTCTATGACCAAAATGAGTTGTAA 

ATTCTCTGGTGCAAGATAAAAGGTCTTGGGAAAACAAAACAAAACAAAACAAACCTCCCTTCC 

CCAGCAGGGCTGCTAGCTTGCTrrCTGCATTTTCAAAATGATAATTTACAATGGAAGGACAAGA 

ATGTCATATTCTCAAGGAAAAAAGGTATATCACATGTCTCATTCTCCTCAAATATTCCATTTGCA 

GACAGACCGTCATATTCTAATAGCTCATGAAATTTGGGCAGCAGGGAGGAAAGTCCCCAGAAA 

TrAAAAAATTTAAAACTCTTATGTCAAGATGTTGATTTGAAGCTGTTATAAGAATTGGGATTCC 

AGATTTGTAAAAAGACCCCCAATGATTCTGGACACTAGATTTTITGTTrGGGGAGGTTGG 

AACATAAATGAAATATCCTGTATmCTTAGGGATACTTGGTTAGTAAATrATAATAGTAGAAA 

TAATACATGAATCCCATTCACAGGTTTCTCAGCCCAAGCAACAAGGTAATTGCGTGCCATTCAG 

CACTGCACCAGAGCAGACAACCTATTTGAGGAAAAACAGTGAAATCCACCTTCCTCTTCACACT 

GAGCCCTCTCTGATTCCTCCGTGTTGTGATGTGATGCTGGCCACGTTrCCAAACGGCAGCTCCAC 

TGGGTCCCCTTTGGTTGTAGGACAGGAAATGAAACATTAGGAGCTCTGCTTGGAAAACAGTTCA 

CTACTTAGGGATTTTTGTTTCCTAAAACTTTTATm 

ACAGAACTTGGCTAATGGAATTCACAGAGGTGTTGCAGCGTATCACTGTTATGATCCTGTGTTT 

AGATTATCCACTCATGCTTCTCCTATrGTACTGCAGGTGTACCTTAAAACTGTTCCCAGT GTACT 

TGAACAGTTGCATTTATAAGGGGGGAAATGTGGTTTAATGGTGCCTGATATCTCAAAGTCTTTT 

GTACATAACATATATATATATATACATATATATAAATATAAATATAAATATATCTCATTGCAGC 

CAGTGATTTAGATTTACAGCTTACTCTGGGGTTATCTCTCTGTCTAGAGCATTGTTGTCCTTCAC 

TGCAGTCCAGTTGGGATTATTCCAAAAGTTTrTTGAGTCTTGAGCTTGGGCTGTGGCCCCGCTGT 

GATCATACCCTGAGCACGACGAAGCAACCTCGTTTCTGAGGAAGAAGCTTGAGTTCTGACTCAC 

TGAAATGCGTGTTGGGTTGAAGATATCnTri'l TCI 1 1 1CTGCCTCACCCCTTTGTCTCCAACCTC 

CATTrCrGTTCACTTTGTGGAGAGGGCATTACTTGTTCGTTATAGACATGGACGTTAAGAGATAT 

TCAAAACTCAGAAGCATCAGCAATGTTTCTCTTlTCrrAGTTCATTCTGCAGAATGGAAACCCAT 

GCCTATTAGAAATGACAGTACTTATTAATTGAGTCCCTAAGGAATATTCAGCCCACTACATAGA 

TAGCTT l lllllllUlllllllirriTAATAAGGACACCTCTTTCCAAACAGGCCATCAAATATGT 

TCTTATCTCAGACTTACGTTGTTTTAAAAGTTTGGAAAGATACACATCTITTC^^ 

AGGAGGTTGGGCTTTCATATCACCTCAGCCAACTGTGGCTCTTAATTTATTGCATAATGATATCC 

ACATCAGCCAACTGTGGCTCTTTAATTTATTGCATAATGATATTCACATCCCCTCAGTTGCAGTG 

AATTGTGAGCAAAAGATCTTGAAAGCAAAAAGCACTAATTAGTTTAAAATGTCACnTTT^ 

TTTTATTATACAAAAACCATGAAGTACTTTTTTTATTTGCT 

CTCATGTTTATGAAGAGAGTTGAGTTTAACAATCCTAGCTTTTAAAAGAAA 

AATATTCTACATGTCATTCAGATATTATGTATATC 

ATATTTCTGTCTTGCGTGATTTGTATAT^ 

CCAAATGGAAG 

Figure 23 

GGCACGAGCGCAGGAGACACAGGCGCTGGCTGCCCCGTCCGCTCTCCGCCTCCGCCGCGCCCTCCTCGCC 
CGGG ATGGGCCCCCCCGCCGCCGCCGGATCCCTCGCCTCCCGGCCGCCGCCGTTGCGCTCGCCGCGCTCG 
CACTGAAGCCCGGGCCCTCGCGCGCCGCGGTTCGCCCCGCAGCCTCGCCCCCTGCCCACCCGGGCGGCCG 
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TAGGGCGGTCACG ATGCTGCCGCCCTTACCCTCCCGCCTCGGGCTGCTGCTGCTGCTGCTCCTGTGCCCG 

GCGCACGTCGGCGGACTGTGGTGGGCTGTGGGCAGCCCCTTGGTTATGGACCCTACCAGCATCTGCAGGA 

AGGCACGGCGGCTGGCCGGGCGGCAGGCCGAGTTGTGCCAGGCTGAGCCGGAAGTGGTGGCAGAGCTAGC 

TCGGGGCGCCCGGCTCGGGGTGCGAGAGTGCCAGTTCCAGTTCCGCTTCCGCCGCTGGAATTGCTCCAGC 

CACAGCAAGGCCTTTGGACGCATCCTGCAACAGGACATTCGGGAGACGGCCTTCGTGTTCGCCATCACTG 

CGGCCGGCGCCAGCCACGCCGTCACGCAGGCCTGTTCTATGGGCGAGCTGCTGCAGTGCGGCTGCCAGGC 

GCCCCGCGGGCGGGCCCCTCCCCGGCCCTCCGGCCTGCCCGGCACCCCCGGACCCCCTGGCCCCGCGGGC 

TCCCCGGAAGGCAGCGCCGCCTGGGAGTGGGGAGGCTGCGGCGACGACGTGGACTTCGGGGACGAGAAGT 

CGAGGCTCTTTATGGACGCGCGGCACAAGCGGGGACGCGGAGACATCCGCGCGTTGGTGCAACTGCACAA 

CAACGAGGCGGGCAGGCTGGCCGTGCGGAGCCACACGCGCACCGAGTGCAAATGCCACGGGCTGTCGGGA 

TCATGCGCGCTGCGCACCTGCTGGCAGAAGCTGCCTCCATTTCGCGAGGTGGGCGCGCGGCTGCTGGAGC 

GCTTCCACGGCGCCTCACGCGTCATGGGCACCAACGACGGCAAGGCCCTGCTGCCCGCCGTCCGCACGCT 

CAAGCCGCCGGGCCGAGCGGACCTCCTCTACGCCGCCGATTCGCCCGACTTTTGCGCCCCCAACCGACGC 

ACCGGCTCCCCCGGCACGCGCGGTCGCGCCTGCAATAGCAGCGCCCCGGACCTCAGCGGCTGCGACCTGC 

TGTGCTGCGGCCGCGGGCACCGCCAGGAGAGCGTGCAGCTCGAAGAGAACTGCCTGTGCCGCTTCCACTG 

GTGCTGCGTAGTACAGTGCCACCGTTGCCGTGTGCGCAAGGAGCTCAGCCTCTGCCTGTGACCCGCCGCC 

CGGCCGCTAGACTGACTTCGCGCAGCGGTGGCTCGCACCTGTGGGACCTCAGGGCACCGGCACCGGGCGC 

CTCTCGCCGCTCGAGCCCAGCCTCTCCCTGCCAAAGCCCAACTCCCAGGGCTCTGGAAATGGTGAGGCGA 

GGGGCTTGAGAGGAACGCCCACCCACGAAGGCCCAGGGCGCCAGACGGCCCCGAAAAGGCGCTCGGGGAG 

CGTTTAAAGGACACTGTACAGGCCCTCCCTCCCCTTGGCCTCTAGGAGGAAACAGTTTTTTAGACTGGAA 

AAAAGCCAGTCTAAAGGCCTCTGGATACTGGGCTCCCCAGAACTGCTGGCCACAGGATGGTGGGTGAGGT 

TAGTATCAATAAAGATATTTAAACCAAAAAAAAAAAAAAAAAAAAA 



Figure 24 

CACGCGTCCGGGCCAATCGGGACTATGAACCGGAAAGCGCTGCGCTGCCTGGGCCACCTCnTC 

TCAGCCTGGGCATGGTCTGCCTCCGGATCGGTGGCTTCTCCTCAGTGGTAGCTCTGGGCGCAAC 

GATCATCTGTAACAAGATCCCAGGCCTGGCTCCCAGACAGCGGGCGATCTGCCAGAGCCGGCC 

CGACGCCATCATCGTCATAGGAGAAGGCTCACAAATGGGCCTGGACGAGTGTCAGTTTCAGTTC 

CGCAATGGCCGCTGGAACTGCTCTGCACTGGGAGAGCGCACCGTCTTCGGGAAGGAGCTCAAA 

GTGGGGAGCCGGGACGGTGCGTTCACCTACGCCATCATTGCCGCCGGCGTGGCCCACGCCATC 

ACAGCTGCCTGTACCCATGGCAACCTGAGCGACTGTGGCTGCGACAAAGAGAAGCAAGGCCAG 

TACCACCGGGACGAGGGCTGGAAGTGGGGTGGCTGCTCTGCCGACATCCGCTACGGCATCGGC 

TTCGCCAAGGTCTTTGTGGATGCCCGGGAGATCAAGCAGAATGCCCGGACTCTCATGAACTTGC 

ACAACAACGAGGCAGGCCGAAAGATCCTGGAGGAGAACATGAAGCTGGAATGTAAGTGCCAC 

GGCGTGTCAGGCTCGTGCACCACCAAGACGTGCTGGACCACACTGCCACAGTTTCGGGAGCTG 

GGCTACGTGCTCAAGGACAAGTACAACGAGGCCGTTCACGTGGAGCCTGTGCGTGCCAGCCGC 

AACAAGCGGCCCACCTTCCTGAAGATCAAGAAGCCACTGTCGTACCGCAAGCCCATGGACACG 

GACCTGGTGTACATCGAGAAGTCGCCCAACTACTGCGAGGAGGACCCGGTGACCGGCAGTGTG 

GGCACCCAGGGCCGCGCCTGCAACAAGACGGCTCCCCAGGCCAGCGGCTGTGACCTCATGTGC 

TGTGGGCGTGGCTACAACACCCACCAGTACGCCCGCGTGTGGCAGTGCAACTGTAAGTTCCACT 

GGTGCTGCTATGTCAAGTGCAACACGTGCAGCGAGCGCACGGAGATGTACACGTGCAAGTGAG 

CCCCGTGTGCACACCACCCTCCCGCTGCAAGTCAGATTGCTGGGAGGACTGGACCGTTTCCAAG 

CTGCGGGCTCCCTGGCAGGATGCTGAGCTTGTCTTTTCTGCTGAGGAAGGTACTITTCCT 

TCCTGCAGGCATCCGTGGGGGAAAAAAAATCTCTCAGAACCCTCAACTATTCTGTTCCACACCC 

AATGCTGCTCCACCCTCCCCCAGACACAGCCCAAGTCCCTCCGCGGCTGGAGCGAAGCCTTCTG 

CAGCAGGAACTCTGGACCCCTGGGCCTCATCACAGCAATATTTAACAATITATTCTGATAAAAA 

TAATATTAATTTATTTAATTAAAAAGAATTCnTCCACCTCAAAAAAA 

AAAAGGGGGG 



WO 03/012082 



PCT/GB02/03409 



Figure 25 21/41 

TCCGCTTACACACCAAGGAAAGTTGGGCTTTGAAGAATTCCATCCCCATGGCCACTGGAGGAA 
GAATATTTCNCCCGTCTTGCTTACCCATCTC 

AGAGGAlTATGTTr(mTCAAAGCCTTCTGTGTACATCTGTCTTTTCACCTGTGTC 

AGCCACAGCTGGTCGGTGAACAATTTCCTGATGACTGGTCCAAAGGCTTACCTGATTTACTCCA 

GCAGTGTGGCAGCTGGTGCCCAGAGTGGTATTGAAGAATGCAAGTATCAGTTTGCCTGGGACC 

GCTGGAACTGCCCTGAGAGAGCCCTGCAGCTGTCCAGCCATGGTGGGCTTCGCAGTGCCAATCG 

GGAGACAGCATTTGTGCATGCCATCAGTTCTGCTGGAGTCATGTACACCCTGACTAGAAACTGC 

AGCCTTGGAGATTTTGATAACTGTGGCTGTGATGACTCCCGCAACGGGCAACTGGGGGGACAA 

GGCTGGCTGTGGGGAGGCTGCAGTGACAATGTGGGCTTCGGAGAGGCGATTrCCAAGCAGTTT 

GTCGATGCCCTGGAAACAGGACAGGATGCACGGGCAGCCATGAACCTGCACAACAACGAGGCT 

GGCCGCAAGGCGGTGAAGGGCACCATGAAACGCACGTGTAAGTGCCATGGCGTGTCTGGCAGC 

TGCACCACGCAGACCTGTTGGCTGCAGCTGCCCGAGTTCCGCGAGGTGGGCGCGCACCTGAAG 

GAGAAGTACCACGCAGCACTCAAGGTGGACCTGCTGCAGGGTGCTGGCAACAGCGCGGCCGCC 

CGCGGCGCCATCGCCGACACCTTTCGCTCCATCTCTACCCGGGAGCTGGTGCACCTGGAGGACT 

CCCCGGACTACTGCCTGGAGAACAAAACGCTAGGGCTGCTGGGCACCGAAGGCCGAGAGTGCC 

TAAGGCGCGGGCGGGCCCTGGGTCGCTGGGAACTCCGCAGCTGCCGCCGGCTCTGCGGGGACT 

GCGGGCTGGCGGTGGAGGAGCGCCGGGCCGAGACCGTGTCCAGCTGCAACTGCAAGTTCCACT 

GGTGCTGTGCAGTCCGCTGCGAGCAGTGCCGCCGGAGGGTCACCAAGTACTTCTGTAGCCGCGC 

AGAGCGGCCGCGGGGGGGCGCTGCGCACAAACCCGGGAGAAAACCCTAAGGGTTTCCTCTGCC 

CCCTCCTITTCCCACTGGTTCTTGGCTTCCTTTAGAGACCCCGGTAATTGTGGAACCTAGGGAAT 

GGGGAACCCGCTCTCCCAGACCTAGGGATCCTGAAAGGGAAAAACTGCAATTTCTCCAAAGCT 

TGCCACT^CCAGCCTGTTTCCCCAArrCCTCTGTGCTCTCCrAAAGCTCTGTCTGAATCCTCGC 

AGCCACACCTAGGTCTGAAAACTCAGGCTTTGAGTTACTGATCTTCCTTGGATTAGGAAAACAG 

GTGTTCCTCCTCCCCTCTCCTATCAGCCCTAATCTCTGACCTAGCCTATCAACCCTTAGGCGCTG 

GAAAAACCTTCTCATACACGCAGGACCCAGGTTAACTCAAAGCTTTGCCCTTTTGCCCACTGTC 

TGCTACCAGGGGCTCACCCTCTGCTGCACCTCTCTTCTGCACAGCTCCTCCCCTGCTACTGCTGA 

CCAAATTCCCAGGAATCTTGAATGCTTTCTCTCCTCTTCTCCCTTTCCTTTCCCAAA^ 

AGGAAACTGGCCCCGGAAAAGCATGTCTTTGGGGTTGGTTCCTAGAGGCAGAGGTTGAAGATG 

GAAGAGGGAGCTCTGGAGTGCTAACTTGAACACCAAGGGTGCTACTCATCCCTATGGTATCATA 

TCATGAATGGACTITACTAGTGGGGCAATGACTTTCCTAGACAATAACCCGAGGGACTCCAGAT 

ACATACCCCGAAGGTCTAGGAAATACGTTAAGGGCAGATTACAGTCATTTCCTACCCTTTAAAG 

GTAACTTCTCCCTTCTCCTGACCTACTTCCTCCTAGCAACCAACTTTACCTCTTCTTCTCCAAAGG 

ATCnTTGTTCCTCTGAGCCAAGACTGAGGTAAATAAAGCCAClTTTCCTCTTC 

CACCTCTAGA 



Figure 26 

GCGGCCGCGTCGACGGAGGGGCTGCAGCTCCGTCAGCCCGGCAGAGCCACCCTGAGCTCGGTG 

AGAGCAAAGCCAGAGCCCCCAGTCCTTTGCTCGCCGGCTTGCTATCTCTCTCGATCACTCCCTCC 

CTTCCTCCCTCCCTTCCTCCCGGCGGCCGCGGCGGCGCTGGGGAAGCGGTGAAGAGGAGTGGCC 

CGGCCCTGGAAGAATGCGGCTCTGACAAGGGGACAGAACCCAGCGCAGTCTCCCCACGGTTrA 

AGCAGCACTAGTGAAGCCCAGGCAACCCAACCGTGCCTGTCTCGGACCCCGCACCCAAACCAC 

TGGAGGTCCTGATCGATCTGCCCACCGGAGCCTCCGGGCTTCGACATGCTGGAGGAGCCCCGGC 

CGCGGCCTCCGCCCTCGGGCCTCGCGGGTCTCCTGTTCCTGGCGTTGTGCAGTCGGGCTCTAAG 

CAATGAGATTCTGGGCCTGAAGTTGCCTGGCGAGCCGCCGCTGACGGCCAACACCGTGTGCTTG 

ACGCTGTCCGGCCTGAGCAAGCGGCAGCTAGACCTGTGCCTGCGCAACCCCGACGTGACGGCG 

TCCGCGCTTCAGGGTCTGCACATCGCGGTCCACGAGTGTCAGCACCAGCTGCGCGACCAGCGCT 

GGAACTGCTCCGCGCTTGAGGGCGGCGGCCGCCTGCCGCACCACAGCGCCATCCTCAAGCGCG 

GTITCCGAGAAAGTGCTTTTTCCTTCTCCATGCTGGCTGCTGGGGTCATGCACGCAGTAGCCAC 

GGCCTGCAGCCTGGGCAAGCTGGTGAGCTGTGGCTGTGGCTGGAAGGGCAGTGGTGAGCAGGA 

TCGGCTGAGGGCCAAACTGCTGCAGCTGCAGGCACTGTCCCGAGGCAAGAGTTTCCCCCACTCT 

CTGCCCAGCCCTGGCCCTGGCTCAAGCCCCAGCCCTGGCCCCCAGGACACATGGGAATGGGGT 

GGCTGTAACCATGACATGGACTITGGAGAGAAGTTCTCTCGGGATTTCTTGGATTCCAGGGAAG 

CTCCCCGGGACATCCAGGCACGAATGCGAATCCACAACAACAGGGTGGGGCGCCAGGTGGTAA 

CTGAAAACCTGAAGCGGAAATGCAAGTGTCATGGCACATCAGGCAGCTGCCAGTTCAAGACAT 
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GCTGGAGGGCGGCCCCAGAGTTCCGGGCAGTGGGGGCGGCGTTGAGGGAGCGGCTGGGCCGG 

GCCATOTCATTGATACCCACAACCGCAATTCTGGAGCCTTCCAGCCCCGTCT^^ 

CCTCTCAGGAGAGCTGGTCT 

GGCTCCCCAGGGACAAGGGGCCGGGCCTGCAACAAGACCAGCCGCCTGTTGGATGGCTGT 

AGCCTGTGCTGTGGCCGTGGGCACAACGTGCTCCGGCAGACACGAGTTGAGCGCT 

GCTTCCACTGGTGCTGCTATGTGCTGTGTGATGAGTGCAAGGTTACAGAGTGGGTGAATGTGTG 

TAAGTGAGGGTCAGCCTTACCTTGGGGCTGGGGAAGAGGACTGTGTGAGAGGGGCGCCTI^C 

AGCCCTTTGCTCTGATTTCCTTCCAA.GGTCACTCTTGGTCCCTGG 

GAAACAGCTTTAGGGGTGGTGGGGGTCAGGTGGACTCTGGGATGTGTAGCCTTCTCCCCAACA 
ATrcGAGGGTCTTGAGGGGAAGCTGCCACCCCTCTTCTGCTCCTTAGACACCTGAATGGACTAA 

GATGAAATGCACTGTATTGCTCCTCCCACTTCTCAACTCCAGAGCCCCTITAAC 
CTCCTTTTGGCTGGGGAGTCCCTATAGTTTCACCACTCCTCTCCCTTGAGGGATAACCCC^GCA 

CTGTITGGAGCCATAAGATCTGTATCTAGAAAGAGATCACCCACTCCTATGTACTAT^ 
CTCCTTTACTGCAGCCTGGGCTCCCTCTTGTGGGATAATGGGAGACAGTGGTAGAGAGG 11 1 1 1 

OTGGGAAAGAGACAGAGTGCTGAGGGGCACTCTCCCCTG^ 

GGCCCTTAGGGAAGTTGTCTCCTTCCATTCAGATGTTAATGGGGACCCTCCAAAGGAAGGGGTT 

TTCCCATGACTCTTGGAGCCTCTTTTTCCTTCTTCAGCAGGAAGGGTGGGAAGGGATAATTT 

ATACTGAGACTTGTTCTTGGTTCCTGTTTGAAACTAAAATAAATTAAGTTACTGGAAAAAAAAA 

AAAAAAAAA 



Figure 27 

TAACCCGCCGCCTCCGCTCTCCCCGGCTGCAGGCGGCGTGCAGGACCAGCGGCGGCCGTGCAG 

GCGGAGGACTTCGGCGCGGCTCCTCCTGGGTGTGACCCCGGGCGCGCCCGCCGCGCGACGATG 

AGGGCGCGGCCGCAGGTCTGCGAGGCGCTGCTCTTCGCCCTGGCGCTCCAGACCGGCGTGTGCT 

ATGGCATCAAGTGGCTGGCGCTGTCCAAGACACCATCGGCCCTGGCACTGAACCAGACGCAAC 

ACTGCAAGCAGCTGGAGGGTCTGGTGTCTGCACAGGTGCAGCTGTGCCGCAGCAACCTGGAGC 

TCATGCACACGGTGGTGCACGCCGCCCGCGAGGTCATGAAGGCCTGTCGCCGGGCCTTTGCCGA 

CATGCGCTGGAACTGCTCCTCCATTGAGCTCGCCCCCAACTATTTGCTTGACCTGGAGAGAGGG 

ACCCGGGAGTCGGCCTTCGTGTATGCGCTGTCGGCCGCCACCATCAGCCACGCCATCGCCCGGG 

CCTGCACCTCCGGCGACCTGCCCGGCTGCTCCTGCGGCCCCGTCCCAGGTGAGCCACCCGGGCC 

CGGGAACCGCTGGGGAAGATGTGCGGACAACCTCAGCTACGGGCTCCTCATGGGGGCCAAGTT 

TTCCGATGCTCCTATGAAGGTGAAAAAAACAGGATCCCAAGCCAATAAACTGATGCGTCTACA 

CAACAGTGAAGTGGGGAGACAGGCTCTGCGCGCCTCTCTGGAAATGAAGTGTAAGTGCCATGG 

GGTGTCTGGCTCCTGCTCCATCCGCACCTGCTGGAAGGGGCTGCAGGAGCTGCAGGATGTGGCT 

GCTGACCTCAAGACCCGATACCTGTCGGCCACCAAGGTAGTGCACCGACCCATGGGCACCCGC 

AAGCACCTGGTGCCCAAGGACCTGGATATCCGGCCTGTGAAGGACTGGGAACTTGTTTATTTGC 

AGAGCTCACCTGACTTTTGCATGAAGAATGAGAAGGTGGGCTCCCACGGGACACAAGACAGGC 

AGTGCAACAAGACTTCCAACGGAAGCGACAGCTGCGACCTTATGTGCTGCGGGCGTGGCTACA 

ACCCCTACACAGACCGCGTGGTCGAGCGGTGCCACTGTAAGTACCACTGGTGTTGCTACGTCAC 

CTGCCGCAGGTGTGAGCGTACCGTGGAGCGCTATGTCTGCAAGTGAGGCCCTGCCCTCCGCCCC 

ACGCAGGAGCGAGGACTTTGCTCAAGGACCCTCAGCAACTGGGGCCGGGGGCCTGGAGACACT 

CCATGGAGCTCTGCTrGTGAATTCCAGATGCCAGGCATGGGAGGCGGCTTGTGCTTTGCCTTCA 

CTTGGAAGCCACCAGGAACAGAAGGTCTGGCCACCCTGGAAGGAGNGCAGGACATCAAAGGA 

AACCGACAAGATTAAAAATAACITGGCAGCCTGAGNTCTGGAGTGCCCACAGNNTGGTGTAAG 

GAGCGGGGCTTGGGATCGGTGAGACTGATACAGACTTGACCTTTCAGGGCCACAGAGACCAGC 

CTCCGGGAAGGGGTCTGCCCGCCTTCTTCAGAATGTTCTGCGGGACCCCCTGGCCCACCCTGGG 

GTCTGAGCCTGCTGGGCCACCACATGGAATCACTAGCTTCGGGTTGTAAATGTITrCTTTTGTTT 

NTTGCTTTITCTTCCTTTGGGATGTTGGAAGCTACAGAAATATTTATAAA^ 
TGGGGTGGCACTTCTCAATTCCTCTTTATATATTTTANATATATAAATATATATGTATATATATA 
ATGATCTCTAATOTAAAACTAGCTTTTTAAGCAGCTGTATGAAATAAATGCTGAGTGAGCCCCA 
GCCCGCCCCTGCAGTTCCCGGCCTCGTCAAGTGAACTCGGCAGACCCTGGGGCTGGCAGAGGG 

AGCTCTCCAGTTTCCGGGCA 
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GGCGCGGCAAGATGCTGGATGGGTCCCCGCTGGCGCGCTGGCTGGCCGCGGCCTTCGGGCTGA 

CGCTGCTGCTCGCCGCGCTGCGCCCTTCGGCCGCCTACTTCGGGCTGACGGGCAGCGAGCCCCT 

GACCATCCTCCCGCTGACCCTGGAGCCAGAGGCGGCCGCCCAGGCGCACTACAAGGCCTGCGA 

CCGGCTGAAGCTGGAGCGGAAGCAGCGGCGCATGTGCCGCCGGGACCCGGGCGTGGCAGAGA 

CGCTGGTGGAGGCCGTGAGCATGAGTGCGCTCGAGTGCCAGTTCCAGTTCCGCTTTGAGCGCTG 

GAACTGCACGCTGGAGGGCCGCTACCGGGCCAGCCTGCTCAAGCGAGGCTTCAAGGAGACTGC 

CTTCCTCTATGCCATCTCCTCGGCTGGCCTGACGCACGCACTGGCCAAGGCGTGCAGCGCGGGC 

CGCATGGAGCGCTGTACCTGCGATGAGGCACCCGACCTGGAGAACCGTGAGGCCTGGCAGTGG 

GGGGGCTGCGGAGACAACCTTAAGTACAGCAGCAAGTTCGTCAAGGAATTCCTGGGCAGAGGG 

TCAAGCAAGGATCTGCGAGCCCGTGTGGACTTCCACAACAACCTCGTGGGTGTGAAGGTGATC 

AAGGCTGGGGTGGAGACCACCTGCAAGTGCCACGGCGTGTCAGGCTCATGCACGGTGCGGACC 

TGCTGGCGGCAGTTGGCGCCTTTCCATGAGGTGGGCAAGCATCTGAAGCACAAGTATGAGACG 

GCACTCAAGGTGGGCAGCACCACCAATGAAGCTGCCGGCGAGGCAGGTGCCATCTCCCCACCA 

CGGGGCCGTGCCTCGGGGGCAGGTGGCAGCGACCCGCTGCCCCGCACTCCAGAGCTGGTGCAC 

CTGGATGACTCGCCTAGCTTCTGCCTGGCTGGCCGCTTCTCCCCGGGCACCGCTGGCCGTAGGT 

GCCACCGTGAGAAGAACTGCGAGAGCATCTGCTGTGGCCGCGGCCATAACACACAGAGCCGGG 

TGGTGACAAGGCCCTGCCAGTGCCAGGTGCGTTGGTGCTGCTATGTGGAGTGCAGGCAGTGCA 

CGCAGCGTGAGGAGGTCTACACCTGCAAGGGCTGAGTTCCCAGGCCCTGCCAGCCCTGCTGCA 

CAGGGTGCAGGCATTGCACACGGTGTGAAGGGTCTACACCTGCACAGGCTGAGTTCCTGGGCT 

CGACCAGCCCAGCTGCGTGGGGTACAGGCATTGCACACAGTGTGAATGGGTCTACACCTGCAT 

GGGCTGAGTCCCTGGGCTCAGACCTAGCAGCGTGGGGTAGTCCCTGGGCTCAGTCCTAGCTGCA 

TGGGGTGCAGGCATTGCACAGAGCATGAATGGGCCTACACCTGCCAAGGCTGAATCCCTGGGC 

CCAGCCAGCCCTGCTGCACATGGCACAGGCATTGCACACGGTGTGAGGAGTGTACACCTGCAA 

GGGCTGAGGCCCTGGGCCCAGTCAGCCCTGCTGCTCAGAGTGCAGGCATTGCACATGGTGTGA 

GAAGGTCTACACCTGCAAGGGACGAGTCCCCGGGCCTGGCCAACCCTGCTGTGCAGGGTGAGG 

GCCATGCATGCTAGTATGAGGGGTCTACACCTGCAAGGACTGAGAGGCTTTT 



Figure 29 

AGCCTGCAAAAACCACAGAGGGCAAAGCCAGAAAGATGGAAAGGCACCCACCCATGCAGCTC 

ACCACTTGCCTCAGGGAGACCCTCTrCACAGGGGCTTCTCAAAAGACCTCCCTATGGTGGTTGG 

GCATTGCCTCCTTCGGGGTTCCAGAGAAGCTGGGCTGCGCCAATTTGCCGCTGAACAGCCGCCA 

GAAGGAGCTGTGCAAGAGGAAACCGTACCTGCTGCCGAGCATCCGAGAGGGCGCCCGGCTGGG 

CATTCAGGAGTGCAGGAGCCAGTTCAGACACGAGAGATGGAACTGCATGATCACCGCCGCCGC 

CACTACCGCCCCGATGGGCGCCAGCCCCCTCTTTGGCTACGAGCTGAGCAGCGGCACCAAAGA 

GACAGCATTTATTTATGCTGTGATGGCTGCAGGCCTGGTGCATTCTGTGACCAGGTCATGCAGT 

GCAGGCAACATGACAGAGTGTTCCTGTGACACCACCTTGCAGAACGGCGGCTCAGCAAGTGAA 

GGCTGGCACTGGGGGGGCTGCTCCGATGATGTCCAGTATGGCATGTGGTTCAGCAGAAAGTTCC 

TAGATTrCCCCATCGGAAACACCACGGGCAAAGAAAACAAAGTACTATTAGCAATGAACCTAC 

ATAACAATGAAGCTGGAAGGCAGGCTGTCGCCAAGTTGATGTCAGTAGACTGCCGCTGCCACG 

GAGTTTCCGGCTCCTGTGCTGTGAAAACATGCTGGAAAACCATGTCITTCTTITGAAAAGATTGG 

CCATTTGTTGAAGGATAAATATGAAAACAGTATCCAGATATCAGACAAAATAAAGAGGAAAAT 

GCGCAGGAGAGAAAAAGATCAGAGGAAAATACCAATCCATAAGGATGATCTGCTCTATGTTAA 

TAAGTCTCCCAACTACTGTGTAGAAGATAAGAAACTGGGAATCCCAGGGACACAAGGCAGAGA 

ATGCAACCGTACATCAGAGGGTGCAGATGGCTGCAACCTCCTCTGCTGTGGCCGAGGTTACAAC 

ACCCATGTGGTCAGGCACGTGGAGAGGTGTGAGTGTAAGTTCATCTGGTGCTGCTATGTCCGTT 

GCAGGAGGTGTGAAAGCATGACTGATGTCCACACTTGCAAGTAACCACTCCATCCAGCCTTGG 

GCAAGATGCCTCAGCAATATACAATGGCATTGCAACCAGAGAGGTGCCCATCCCTGTGCAGCG 

CTAGTAAAGTTGACTCTTGCAGTGGAATCCC 
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AGTTGAGGGATTGACACAAATGGTCAGGCGGCGGCGGCGGAGAAGGAGGCGGAGGCGCAGGG 

GGGAGCCGAGCCCGCTGGGCTGCGGAGAGTTGGGCTCTCTACGGGGCCGCGGCCACTAGCGCG 

GCGCCGCCAGCCGGGAGCCAGCGAGCCGAGGGCCAGGAAGGCGGGACACGACCCCGGCGCGC 

CCTAGCCACCCGGGTTCTCCCCGCCGCCCGCGCTTCATGAATCGCAAGTTTCCGCGGCGGCGGC 

GGCTGCGGTACGCAGAACAGGAGCCGGGGGAGCGGGCCGAAAGCGGCTTGGGCTCGACGGAG 

GGCACCCGCGCAGAGGTCTCCCTGGCCGCAGGGGGAGCCGCCGCCGGCCGTGCCCCTGGCAGC 

CCCAGCGGAGCGGCGCCAAGAGAGGAGCCGAGAAAGTATGGCTGAGGAGGAGGCGCCTAAGA 

AGTCCCGGGCCGCCGGCGGTGGCGCGAGCTGGGAACTTTGTGCCGGGGCGCTCTCGGCCCGGC 

TGGCGGAGGAGGGCAGCGGGGACGCCGGTGGCCGCCGCCGCCCGCCAGTTGACCCCCGGCGAT 

TGGCGCGCCAGCTGCTGCTGCTGCTTTGGCTGCTGGAGGCTCCGCTGCTGCTGGGGGTCCGGGC 

CCAGGCGGCGGGCCAGGGGCCAGGCCAGGGGCCCGGGCCGGGGCAGCAACCGCCGCCGCCGC 

CTCAGCAGCAACAGAGCGGGCAGCAGTACAACGGCGAGCGGGGCATCTCCGTCCCGGACCACG 

GCTATTGCCAGCCCATCTCCATCCCGCTGTGCACGGACATCGCGTACAACCAGACCATCATGCC 

CAACCTGCTGGGCCACACGAACCAGGAGGACGCGGGCCTGGAGGTGCACCAGTTCTACCCTCT 

AGTGAAAGTGCAGTGTTCCGCTGAGCTCAAGTTCTTCCTGTGCTCCATGTACGCGCCCGTGTGC 

ACCGTGCTAGAGCAGGCGCTGCCGCCCTGCCGCTCCCTGTGCGAGCGCGCGCGCCAGGGCTGC 

GAGGCGCTCATGAACAAGTTCGGCTTCCAGTGGCCAGACACGCTCAAGTGTGAGAAGTTCCCG 

GTGCACGGCGCCGGCGAGCTGTGCGTGGGCCAGAACACGTCCGACAAGGGCACCCCGACGCCC 

TCGCTGCTTCCAGAGTTCTGGACCAGCAACCCTCAGCACGGCGGCGGAGGGCACCGTGGCGGC 

TTCCCGGGGGGCGCCGGCGCGTCGGAGCGAGGCAAGTTCTCCTGCCCGCGCGCCCTCAAGGTG 

CCCTCCTACCTCAACTACCACTTCCTGGGGGAGAAGGACTGCGGCGCACCTTGTGAGCCGACCA 

AGGTGTATGGGCTCATGTACTTCGGGCCCGAGGAGCTGCGCTTCTCGCGCACCTGGATTGGCAT 

TTGGTCAGTGCTGTGCTGCGCCTCCACGCTCTTCACGGTGCTTACGTACCTGGTGGACATGCGG 

CGCTTCAGCTACCCGGAGCGGCCCATCATCTTCTTGTCCGGCTGTTACACGGCCGTGGCCGTGG 

CCTACATCGCCGGCTTCCTCCTGGAAGACCGAGTGGTGTGTAATGACAAGTTCGCCGAGGACGG 

GGCACGCACTGTGGCGCAGGGCACCAAGAAGGAGGGCTGCACCATCCTCTTCATGATGCTCTA 

CTrCTTCAGCATGGCCAGCTCCATCTGGTGGGTGATCCTGTCGCTCACCTGGTTCCTGGCGGCTG 

GCATGAAGTGGGGCCACGAGGCCATCGAAGCCAACTCACAGTATTTTCACCTGGCCGCCTGGG 

CTGTGCCGGCCATCAAGACCATCACCATCCTGGCGCTGGGCCAGGTGGACGGCGATGTGCTGA 

GCGGAGTGTGCTTCGTGGGGCTTAACAACGTGGACGCGCTGCGTGGCTTCGTGCTGGCGCCCCT 

CTTCGTGTACCTGTTTATCGGCACGTCCTTTCTGCTGGCCGGCTTTGTGTCGCTCrTCCGCATCCG 

CACCATCATGAAGCACGATGGCACCAAGACCGAGAAGCTGGAGAAGCTCATGGTGCGCATTGG 

CGTCrrCAGCGTGCTGTACACTGTGCCAGCCACCATCGTCATCGCCTGCTACTTCTACGAGCAG 

GCCTTCCGGGACCAGTGGGAACGCAGCTGGGTGGCCCAGAGCTGCAAGAGCTACGCTATCCCC 

TGCCCTCACCTCCAGGCGGGCGGAGGCGCCCCGCCGCACCCGCCCATGAGCCCGGACTTCACG 

GTCTTCATGATTAAGTACCTTATGACGCTGATCGTGGGCATCACGTCGGGCTTCTGGATCTGGTC 

CGGCAAGACCCTCAACTCCTGGAGGAAGTTCTACACGAGGCTCACCAACAGCAAACAAGGGGA 

GACTACAGTCTGAGACCCGGGGCTCAGCCCATGCCCAGGCCTCGGCCGGGGCGCAGCGATCCC 

CCAAAGCCAGCGCCGTGGAGTTCGTGCCAATCCTGACATCTCGAGGTTTCCTCACTAGACAACT 

CTCTITCGCAGGCTCCTTTGAACAACTCAGCTCCTGCAAAAGCTTCCGTCCCTGAGGCAAAAGG 

ACACGAGGGCCCGACTGCCAGAGGGAGGATGGACAGACCTCTTGCCCTCACACTCTGGTACCA 

GGACTGTTCGCTTITATGATTGTAAATAGCCTGTGTAAGATTTTTGTAAGTATATTTGTAm 

atgacgaccgatcacgcgtttitctttitcaaaagtttitaa™ 

TGAGGCTITrCCTrCTTGCCCTTTTCGGAGTATTGCAAAGGAGCTAAAACTGGTGTGCAACCGC 

acagcgctcctggtcgtcctcgcgcgcctctccctaccacgggtgctcgggacggctgggcgcc 

AGCTCCGGGGCGAGTTCAGCACTGCGGGGTGCGACTAGGGCTGCGCTGCCAGGGTCACTTCCC 

GCCTCCTCCTTITGCCCCCTCCCCCrCCTrCTGTCCCCTCCCTITCTTTCCTGGCTTGAGGTAGGG 

GCTCTTAAGGTACAGAACTCCACAAACCTTCCAAATCTGGAGGAGGGCCCCCATACATTACAAT 

TCCTCCCTTGCTCGGCGGTGGATTGCGAAGGCCCGTCCCTTCGACTTCCTGAAGCTGGATTnTA 

ACTGTCCAGAACTTTCCTCCAACTTCATGGGGGCCCACGGGTGTGGGCGCTGGCAGTCTCAGCC 

TCCCTCCACGGTCACCTTCAACGCCCAGACACTCCCTTCTCCCACCTTAGTTGGTTACAGGGTGA 

GTGAGATAACCAATGCCAAACTTTTTGAAGTCTAATTTTTGAGGGGTGAGCTCATTT 

AGTGTCTAAAACCTGGTATGGGTTTGGCCAGCGTCATGGAAAGATGTGGTTACTGAGATTTGGG 

AAGAAGCATGAAGCTTTGTGTGGGTTGGAAGAGACTGAAGATATGGGTTATAAAATGTTAATT 

CTAATTGCATACGGATGCCTGGCAACCTTGCCTTTGAGAATGAGACAGCCTGCGCTTAGATTTT 

ACCGGTCTGTAAAATGGAAATGTTGAGGTCACCTGGAAAGCTTTGTTAAGGAGTTGATGTTTGC 
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TTTCCHTAACAAGACAGCAAAACGTA^ 
GGACTrCCTCAAAATGAAGTGCTATTTTCTTATTTTTAAT^ 

A(mTAAAATGTAAAAGTTGTACACTITCAACAlTITATTACGATTATTATTCAGCAGCACATTC 

TGAGGGGGGAACAATTCACACCACCAATAATAACCTGGTAAGATTTCAGGAGGTAAAGAAGGT 

GGAATAATTGACGGGGAGATAGCGCCTGAAATAAACAAAATATGGGCATGCATGCTAAAGGG 

AAAATGTGTGCAGGTCTACTGCATTAAATCCTGTGTGCTCCTCTTTTGGATTTACAGAAATGTGT 

CAAATGTAAATCTTTCAAAGCCATTTAAAAATATTCACTTTAGTTCT 

AAAGCAATCCTCCTGATTGTATTGTTTTAAACTTTAAGAATTTATCAAAATGCCGGTAOT 

ACCTAAATTTATCTATGTCTGTCATACGCTAAAATGATATTGGTCTTTGAATTTGGTATACAT^ 

ATTCTGTTCACTATCACAAAATCATCTATATTTATAGAGGAATAGAAGTTTATATATATATAATA 

CCATATTTTTAATTTCACAAATAAAAAATTCAAAGTTTTGTACAAAATTATATGGAT^ 

TGAAAATAATAGAGCTTGAGCTGTCTGAACTATTITACATTTTATGGTGTCTCATAGCCAATCCC 

ACAGTGTAAAAATTCA 



Figure 31 

CGAGTAAAGTTTGCAAAGAGGCGCGGGAGGCGGCAGCCGCAGCGAGGAGGCGGCGGGGAAGA 

AGCGCAGTCTCCGGGTTGGGGGCGGGGGCGGGGGGGGCGCCAAGGAGCCGGGTGGGGGGCGG 

CGGCCAGCATGCGGCCCCGCAGCGCCCTGCCCCGCCTGCTGCTGCCGCTGCTGCTGCTGCCCGC 

CGCCGGGCCGGCCCAGTTCCACGGGGAGAAGGGCATCTCCATCCCGGACCACGGCTTCTGCCA 

GCCCATCTCCATCCCGCTGTGCACGGACATCGCCTACAACCAGACCATCATGCCCAACCTTCTG 

GGCCACACGAACCAGGAGGACGCAGGCCTAGAGGTGCACCAGTTCTATCCGCTGGTGAAGGTG 

CAGTGCTCGCCCGAACTGCGCTTCTTCCTGTGCTCCATGTACGCACCCGTGTGCACCGTGCTGG 

AACAGGCCATCCCGCCGTGCCGCTCTATCTGTGAGCGCGCGCGCCAGGGCTGCGAAGCCCTCAT 

GAACAAGTTCGGTTTTCAGTGGCCCGAGCGCCTGCGCTGCGAGCACTTCCCGCGCCACGGCGCC 

GAGCAGATCTGCGTCGGCCAGAACCACTCCGAGGACGGAGCTCCCGCGCTACTCACCACCGCG 

CCGCCGCCGGGACTGCAGCCGGGTGCCGGGGGCACCCCGGGTGGCCCGGGCGGCGGCGGCGCT 

CCCCCGCGCTACGCCACGCTGGAGCACCCCTTCCACTGCCCGCGCGTCCTCAAGGTGCCATCCT 

ATCTCAGCTACAAGTTTCTGGGCGAGCGTGATTGTGCTGCGCCCTGCGAACCTGCGCGGCCCGA 

TGGTTCCATGTTCTTCrCACAGGAGGAGACGCGTTTCGCGCGCCTCTGGATCCTCACCTGGTCG 

GTGCTGTGCTGCGCTTCCACCTTCTTCACTGTCACCACGTACTTGGTAGACATGCAGCGCTTCCG 

CTACCCAGAGCGGCCTATCATTTTTCTGTCGGGCTGCTACACCATGGTGTCGGTGGCCTACATC 

GCGGGCTTCGTGCTCCAGGAGCGCGTGGTGTGCAACGAGCGCTTCTCCGAGGACGGTTACCGC 

ACGGTGGTGCAGGGCACCAAGAAGGAGGGCTGCACCATCCTCTTCATGATGCTCTACTTCTTCA 

GCATGGCCAGCTCCATCTGGTGGGTCATCCTGTCGCTCACCTGGTTCCTGGCAGCCGGCATGAA 

GTGGGGCCACGAGGCCATCGAGGCCAACTCTCAGTACTTCCACCTGGCCGCCTGGGCCGTGCCG 

GCCGTCAAGACCATCACCATCCTGGCCATGGGCCAGATCGACGGCGACCTGCTGAGCGGCGTG 

TGCTTCGTAGGCCrCAACAGCCTGGACCCGCTGCGGGGCITCGTGCTAGCGCCGCTCrrTCGTGT 

ACCTGTTCATCGGCACGTCCTTCCTCCTGGCCGGCTTCGTGTCGCTCTTCCGCATCCGCACCATC 

ATGAAGCACGACGGCACCAAGACCGAAAAGCTGGAGCGGCTCATGGTGCGCATCGGCGTCTTC 

TCCGTGCTCTACACAGTGCCCGCCACCATCGTCATCGCITGCTACTTCrrACGAGCAGGCCTTCCG 

CGAGCACTGGGAGCGCTCGTGGGTGAGCCAGCACTGCAAGAGCCTGGCCATCCCGTGCCCGGC 

GCACTACACGCCGCGCATGTCGCCCGACTTCACGGTCTACATGATCAAATACCTCATGACGCTC 

ATCGTGGGCATCACGTCGGGCTTCTGGATCTGGTCGGGCAAGACGCTGCACTCGTGGAGGAAG 

TTCTACACTCGCCTCACCAACAGCCGACACGGTGAGACCACCGTGTGAGGGACGCCCCCAGGC 

CGGAACCGCGCGGCGCTTTCCTCCGCCCGGGGTGGGGCCCCTACAGACTCCGTATTTTATTITTT 

TAAATAAAAAACGATCGAAACCATTTCACTTITAGGTTGCTTm 

AACACCCCC 

Figure 32 

GCCGCTCCGGGTACCTGAGGGACGCGCGGCCGCCCGCGGCAGGCGGTGCAGCCCCCCCCCACC 
CCTTGGAGCCAGGCGCCGGGGTCTGAGGATAGCATTTCTCAAGACCTGACTTATGGAGCACTTG 
TAACCTGAGATATTTCAGTTGAAGGAAGAAATAGCTCTTCTCCTAAGATGGAATCTGTGGTTT 
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GGAATGTGGTTGATCAACTTGATATGTTGGCCAAATGTGCCCCATGTAAT AAAA TGAAAAGAA 

GAGACAAGATGATGTCATTTTCCCATATTGTGAAACCAAAAACAAACGCCrnTGTGAGACCAA 

GCTAACAAACCTCTGACGGTGCGAAGAGTATTTAACTGTTTGAAGAATTTAACAGTAAGATACA 

GAAGAAGTACCTrCGAGCTGAGACCTGCAGGTGTATAAATATCTAAAATACATATTGAATAGG 

CCTGATCATCTGAATCTCCTTCAGACCCAGGAAGGATGGCTATGACITGGATTGTCTTCTCTCnT 

TGGCCCTTGACTGTGTTCATGGGGCATATAGGTGGGCACAGTTTGTTTTCTTGTGAACCTATTAC 

CTTGAGGATGTGCCAAGATTTGCCTTATAATACT 

ACCAACAGACAGCAGCTTTGGCAATGGAGCCATTCCACCCTATGGTGAATCTGGATTGTTCTCG 

GGATTTCCGGCCTTTTCTTTGTGCACTCTACGCTCCTATTTGTATGGAATATG 

TTCCCTGTCGTAGGCTGTGTCAGCGGGCTTACAGTGAGTGTTCGAAGCTCATGGAGATGTTTGG 

TGTTCCTTGGCCTGAAGATATGGAATGCAGTAGGTTCCCAGATTGTGATGAGCCATATCCTCGA 

CTTGTGGATCTGAATTTAGCTGGAGAACCAACTGAAGGAGCCCCAGTGGCAGTGCAGAGAGAC 

TATGGTTTTTGGTGTCCCCGAGAGTTAAAAATTGATCCTGATCTGGGTTATTCTITTCTGCATGT 

GCGTGATTGTTCACCTCCTTGTCCAAATATGTACTTCAGAAGAGAAGAACTGTCAT TTGCTCG CT 

ATTTCATAGGATTGATTTCAATCATTTGCCrCTCGGCCACATTGTITACTTT^ 

TTGATGTCACAAGATTCCGTTATCCTGAAAGGCCTATTATATTTTATGCAGTCTGCTACATGATG 

GTATCCTTAATTTTCTTCATTGGATTTTTC 

TGCACAATATAAGGCITCCACAGTGACACAAGGATCTCATAATAAAGCCTGTACCATGCITrTT 

ATGATACTCTATTTTTTTACTATGGCTGGCAGTGTATGGTGGGTAATTCTTACCATCACATGG^ 

TTTAGCAGCTGTGCCAAAGTGGGGTAGTGAAGCTATTGAGAAGAAAGCATTGCTGTTTCACGCC 

AGTGCATGGGGCATCCCCGGAACTCTAACCATCATCCTTTTAGCGATGAATAAAATTGAAGGTG 

ACAATATTAGTGGCGTGTGTTTTGTTGGCCTCTACGATGTTGATGCATTGAGATATTTTGTrcnT 

GCTCCCCTCTGCCTGTATGTGGTAGTTGGGGTTTCTCTC 

CAGAGTTCGAATTGAGATTCCATTAGAAAAGGAGAACCAAGATAAATTAGTGAAGTTTATGAT 
CCGGATCGGTGTTTTCAGCATTCmATCTCGT^^ 

TGAGCAAGCTTACCGGGGCATCTGGGAAACAACGTGGATACAAGAACGCTGCAGAGAATATCA 

CATTCCATGTCCATATCAGGTTACTCAAATGAGTCGTCCAGACTrGATTCrCTTTCTGATGAAAT 

ACCTGATGGCrCTCATAGTTGGCATTCCCTCTGTATTTTGGGTTGGAAGCAAAAAGACATGC^ 

GAATGGGCCAGTTTTTTTCATGGTCGTAGGAAAAAAGAGATAGTGAATGAGAGCCGACAGGTA 

CTCCAGGAACCTGATTTTGCTCAGTCTCTCCTGAGGGATCCAAATACTCCT 

CAAGGGGAACTTCCACTCAAGGAACATCCACCCATGCnTCITCAACTCAGCTGGCTATGGTGGA 

TGATCAAAGAAGCAAAGCAGGAAGCATCCACAGCAAAGTGAGCAGCTACCACGGCAGCCTCC 

ACAGATCACGTGATGGCAGGTACACGCCCTGCAGTTACAGAGGAATGGAGGAGAGACTACCTC 

ATGGCAGCATGTCACGACTAACAGATCACTCCAGGCATAGTAGTTCTCATCGGCTCAATGAACA 

GTCACGACATAGCAGCATCAGAGATCTCAGTAATAATCCCATGACTCATATCACACATGGCACC 

AGCATGAATCGGGTTATTGAAGAAGATGGAACCAGTGCTTAATTTGTCTTGTCTAAGGTGGAAA 

TCTTGTGCTGTTTAAAAAGCAGATTTTATTCTTTGCCTTTTGCAT^ 

GTTAACATGCTTTCAGTCAAGTACAGATTGTGTCCACTGGAAAGGTAAATGATTGCTrilllATA 
TTGCATCAAACTTGGAACATCAAGGCATCCAAAACACTAAGAATTCTATCATCACAAAAATAAT 
TCGTCTTTCTAGGTTATGAAGAGATAATTATTTGTCTGGTAAGCATTTTTATAAACCCACTCA^ 
TTATATTTAGAAAAATCCTAAATGTGTGGTGACT^ 

TAGTTGTGAGATAACATTCTGGTAGCTCAGITAATAAAACAATTTCAGAATTAAAGAAATTTTC 

TATGCAAGGTTTACTTCTCAGATGAACAGTAGGACTTTGTAGTTTTATTTCCACT 

AAGAACTGTGTTTTTAAACTGTAGGAGAATTTAATAAATCAGCAAGGGTATTTTAGCTAATAGA 

ATAAAAGTGCAACAGAAGAATTTGATTAGTCTATGAAAGGTTCTCTTAAAATTCTATCGAAATA 

ATCTTCATGCAGAGATATTCAGGGTTTGGATTAGCAGTGGAATAAAGAGATGGGCATTGTTTCC 

CCTATAATTGTGCTGTTTTTATAACIT^ 

ATCCATATGCATGATGGAAAAATTTTAATITGTAGCCAT<mTrCCCATGTAATAGTATTGATTC 
ATAGAGAACTTAATGTTCAAAATTTGCTTTGTGGAGGCATGTAATAAGATAAACATCATACATT 
ATAAGGTAACCACAATTACAAAATGGCAAAACA 

Figure 33 

GCTGCGCAGCGCTGGCTGCTGGCTGGCCTCGCGGAGACGCCGAACGGACGCGGCCGGCGCCGG 
CTTGTGGGCTCGCCGCCTGCAGCCATGACCCTCGCAGCCTGTCCCTCGGCCTCGGCCCGGGACG 
TCTAAAATCCCACACAGTCGCGCGCAGCTGCTGGAGAGCCGGCCGCTGCCCCCTCGTCGCCGCA 



WO 03/012082 PCT/GB02/03409 



27/41 



TCACACTCCCGTCCCGGGAGCTGGGAGCAGCGCGGGCAGCCGGCGCCCCCGTGCAAACTGGGG 
GTGTCTGCCAGAGCAGCCCCAGCCGCTGCCGCTGCTACCCCCGATGCTGGCCATGGCCTGGCGG 
GGCGCAGGGCCGAGCGTCCCGGGGGCGCCCGGGGGCGTCGGTCTCAGTCTGGGGTTGCTCCTG 
CAGTTGCTGCTGCTCCTGGGGCCGGCGCGGGGCTTCGGGGACGAGGAAGAGCGGCGCTGCGAC 
CCCATCCGCATCTCCATGTGCCAGAACCTCGGCTACAACGTGACCAAGATGCCCAACCTGGTTG 
GGCACGAGCTGCAGACGGACGCCGAGCTGCAGCTGACAACTTTCACACCGCTCATCCAGTACG 



GCTGCTCCAGCCAGCTGCAGTTOTCCTITGTTCTGtrTATGTGCCAATGTGCACAGAGAAGATC 
AACATCCCCATTGGCCCATGCGGCGGCATGTGTCTTTCAGTCAAGAGACGCTGTGAACCCGTCC 
TGAAGGAATTTGGATTTGCCTGGCCAGAGAGTCTGAACTGCAGCAAATTCCCACCACAGAACG 
ACCACAACCACATGTGCATGGAAGGGCCAGGTGATGAAGAGGTGCCCTTACCTCACAAAACCC 
CCATCCAGCCTGGGGAAGAGTGTCACTCTGTGGGAACCAATTCTGATCAGTACATCTGGGTGAA 
AAGGAGCCTGAACTGTGTGCTCAAGTGTGGCTATGATGCTGGCTTATACAGCCGCTCAGCCAAG 
^ , ^r^^y ^rrr^ « t- a T/^Tr'n a ^^/rl/^r^r;Tr^T^;^^;^'^ , A ^;^'^^^iT^lTTT^ , A TrTTC.n A(TTGCCTTC ACAGT 



MuUAULUluAALlUllilU^10AAUlumu^iAJiun.iu^iuuvi 

GAGTTCACTGATATCTGGATGGCTGTGTGGGCCAGCCTGTGTTTCATCTCCACTGCCTTCACAGT 
ACTGACCTTCCTGATCGATTCnTCTAGGTTTTCCTACCCTGAGCGCCCCATCATATTTCTCAGTA 
TGTGCTATAATATTTATAGCATTGCTTATATTGTCAGGCTGACTGTAGGCCGGGAAAGGATATC 
CTGTGATTTTGAAGAGGCAGCAGAACCTGTTCTCATCCAAGAAGGACTTAAGAACACAGGATG 
TGCAATAATmCTTGCTGATGTACTITmGGAATGGCCAGCTCCATTTGGTGGGTTATTCTGA 
CACTCACTTGGTTTTTGGCAGCAGGACTCAAATGGGGTCATGAAGCCATTGAAATGCACAGCTC 
TrATTTCCACATTGCAGCCTGGGCCATCCCCGCAGTGAAAACCATTGTCATCTTGATTATGAGA 
CTGGTGGATGCAGATGAACTGACTGGCTTGTGCTATGTTGGAAACCAAAATCTCGATGCCCTCA 
CCGGGTTCGTGGTGGCTCCCCTCITrACTTATTTGGTCATTGGAA(mTGTTCATTGCTGCAGGT 
TTGGTGGCCTTGTTCAAAATTCGGTCAAATCTrCAAAAGGATGGGACAAAGACAGACAAGTTA 
GAAAGACTGATGGTCAAGATTGGGGTGTTCTCAGTACTGTACACAGTTCCTGCAACGTGTGTGA 
TTGCCTGTTATTTTTATGAAATCTCCAACTGGGCACTITITCGGTATTCT 

ATGGCTGTTGAAATGTTGAAAACTTTTATGTCTTTGTTGGTGGGCATCACTTCAGGCATGTGGAT 
TTGGTCTGCCAAAAGTCITCACACGTGGCAGAAGTGTTCCAACAGATTGGTGAATTCTGGAAAG 
GTAAAGAGAGAGAAGAGAGGAAATGGTTGGGTGAAGCCTGGAAAAGGCAGTGAGACTGTGGT 
ATAAGGCTAGTCAGCCTCCATGCTITCTrCATTTrGAAGGGGGGAATGCCAGCATTTTGGAGGA 



Al AAUUL. 1 AU I U AUt^L. 1 lU^l llbl iiiunnuuuuuvj^i^^^^w - 

AATTCTACTAAAAGTTTTATGCAGTGAATCTCAGTTTGAACAAACTAGCAACAATTAAGTGACC 
CCCGTCAACCCACTGCCTCCCACCCCGACCCCAGCATCAAAAAACCAATGATTTTGCTGCAGAC 
TITGGAATGATCCAAAATGGAAAAGCCAGTTAGAGGCTTTCAAAGCTGTGAAAAATCAAAACG 
TTGATCACTTTAGCAGGTTGCAGCTTGGAGCGTGGAGGTCCTGCCTAGATTCCAGGAAGTCCAG 
GGCGATACTGTTTTCCCCTGCAGGGTGGGATTTGAGCTGTGAGTTGGTAACTAGCAGGGAGAAA 
TATTAACTTTTTTAACCCTTTACCATTTTAAATACTAACTGGGTCTTr 

TATAAACACTGGAAACGCTGGGTTCAGAAAAGTGTTACAAGAGTTTTATAGTTTGGCTGATGTA 

ACATAAACATCTTCrGTGGTGCGCTGTCTGCTGTrTAGAACTTTGTGGACTGCACTCCCAAGAA 

GTGGTGTTAGAATCnTTCAGTGCCTTTGTCATAAAACAGTTATTTGAACAAACAAAAGTACTGT 

ACTCACACACATAAGGTATCCAGTGGATTTTTCTTCTCTGT^ 

CTCTTCTTGGCTGCTGCTGTTTTCTTCATTTTATC^ 

TTITrGTACTGCAGCATGCTTAAAGAGGGGAAAAGGAAGGGTGATTCACTTTCTGACAATCACT 
TAATTCAGAGGAAAATGAGATTTACTAAGTTGACTTACCTGACGGACCCCAGAGACCTATTGCA 
TrGAGCAGTGGGGACTTAATATATTTTACTTGTGTGATTGCATCTATGCAGACGCCAGTCTGGA 
AGAGCTGAAATGTTAAGTITCnTGGCAACITTGCATT 

TGTCAATTACAATTAAAAGCACATTGTTGGACCATGACATAGTATACTCAACTGACTTTAAAAC 

TATGGTCAACTTCAACTTGCATTCTCAGAATGATAGTGCCITrAAAATTTTm 

CATAAGAATGTTATCAGAATCTGGTCTACTrAGGACAATGGAGACTTmCAGTTrTATAAAGG 

GAACTGAGGACAGCTAATCCAACTACTTGGTGCTGTAATTGTTTCCTAGTAATTGGCAAAGGCT 

CCTTGTAAGATITCACTGGAGGCAGTGTGGCCTGGAGTATTrATATGGTGCTTAATGAATCTCC 

AGAATGCCAGCCAGAAGCCTGATTGGTTAGTAGGGAATAAAGTGTAGACCATATGAAATGAAC 

TGCAAACTCTAATAGCCCAGGTCTTAATTGCCTTTAGCAGAGGTATCCAAAGCTTITAAAATTT 

ATGCATACGTTCTTCACAAGGGGGTACCCCCAGCAGCCTCTCGAAAATTGCACTTCTCTTAAAA 

CTGTAACTGGCCTTTCTCTTACCTTGCCTTAGGCCTTCTAATCATGAGATCTTGGGGACAAATTG 

ACTATGTCACAGGTTGCTCTCCnTGTAACTCATACCTGTCTGCTTCAGCAACTGCmGCAATGA 

CATTTATITATTAATTCATGCCrTAAAAAAATAGGAAGGGAAGClllIll'ir 1 1 1 iTT 

TrCAATCACACTTTGTGGAAAAACATITCCAGGGACTCAAAATTCCAAAAAGGTGGTCAAATTC 

TGGAAGTAAGCATTTCCTCTTTTTTAAAAATTTGGTTTGAGCCTTATGCCC 
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rrrTTTCTTCTTrCCTTTTTGTTTTTGTGTGGT^ 



GTCGATTGTATGTTTTGAAGGCAAAGTC1 IGOrtJ 1 1 1 lUAUAtiunnui i^^v^--^ 

gScctcctgctgtgcccagtctgagtacct^^ 

ATGTCAACTCCCTGTC 

ccagg^SgScctgccot 
?gg?c^caggSaaggAgaccaggcggtagg 

aIgLca^^^^^ 
ttgggttttx^S 

A^^GGCCAnTGGATTAGTAGCCTTAGCAATGCTACAGGGTTATAGGCCCCT 

a fTr a r nrr A ATG AACTCCATCACTTCTTAAATCGGTATTTGTTAAAAAAATCAGTTATTTTAT 

Stcggg^cS 

gaattcccatctgtitcactgtctccattccataaatctcttcctgtgtgagccaccacacc^g 

C^CTGGGTCTCTCTACTTTTAACACATCTCTCATCC^ 
A^GTGGlTTTAAGAGAAAGCATCAGCTCTGCTrCGTGACAG^ 

gISgIgact^^ 

GGGC^GGGGTATrCCATGTGACTrGTATAGGTATATrrGAGGACAGCATCTTGCTAGAGAAAA 
GGTGAG^G^GTTTITCTTTCTCT 

ATCCCTrGGCCTCCAGAGATTAAACATGGTGCAATGGCACCTCTGTCCAACCTCCTTTCTGGTA 
GASccmCTCCTGCTTCATATAGGCCAAACCTCAGGGCAAGGGAACATGGGGGTAGAGTGGT 

GCTGGOCAGAA<rc 
AmCCTGATCTCTGAGAC 

CTCTCCA^CTCA^CC^ 

rCTTTTAGCTCTATACTCTCTGGCTCCCCTCATCCrCATGGTCACTGAATTAAATGCTTATTGTAT 
TGAGAACCAAGATGGGACCTGAGGACACAAAGATGAGCTC^ 

AGACTCAGGGATTTCACCAGGTCGGTGCAGTATTTGATTTCTGGTGAGGTGACCACAGCTGCAG 
TTAGGAAGGGAGCCATTGAGCACA 

^GTTTGTTTGTTTGTTTGAGACAGGGTCTTGCTCTGCTACCCAGGCTGGGGCGCAATGGCACG 
. . ^nm-n^nnw a AnxnATTrTCPTrfrrArAGCCTCCTGAGGA 




GCTGGGACTACAGGTGCGTGCTACCACGCCCAuC 1 AvJl iuuiai 1 1 1 ^^^™^,V A 
TCACTGTGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCATGATCT 
AGTGCTGGGATTACAAGTGTGAGCCACCACACCTGGCCTGGAAGGAACCTCTTAAA 
ACGTCTTGTATTTTGTTCTGTGATGGAGGACACTGGAGAGAGTTGCTATTCCAGTCAATCATGTC 

GAGTCACTGGACTCTGAAAATCCTATTGGTTCCTTTATTTT^ 

GGTTTGTATTATGTCTGGCAAATGACCTGGGTTATCACT^ 

TTGGAAACTCCTTAGAGAGCATTTTGCTCCTACCAAGGATCAGATACTC 

GATTTCATTTCACTCTAGCCTACATAGAGCTTTCTGTTGCTGTCTCTTGCCATGCM 

TGATTACACACTTGAGAGTACGAGGAGACAAATGACTTACAGATCCCCCGACATrc 

CirGGCAAGCTCAGTTGCCCTGATAGTAGCATGTTTCTGTT^ 

CTTTGCATCAGCCAATTCCCAGAATTTCCCCAGGCAATTTGTAGAGGA 

ATGAGCCATGTCCTCAAAGCTTTTAAACCTCCTTGCTCTCCTACAATATTCAGTACATCA^^ 

GTCATCCTAGAAGGCTTCTGAAAAGAGGGGCAAGAGCCACTCTGCGCCACAAAGGTTGGATCC 

ATCTTCTCTCCGAGGTTGTGAAAGTTITCAAATTGTACTAATAGGCTGGGGCCCTGACTTGGCTG 

TGGGCTTTGGGAGGGGTAAGCTGCTITCTAGATCTCTCCCAGTGAGGCATGGAGGTG 

ATTTTGTCTACCTCACAGGGATGTTGTGAGGCTTGAAAAGGTCAAAAAATGATGGC 

CTCTTTGTAAGAAAGGTAGATGAAATATCGGATGT^^ 

CCCCTGCTCTGTGCAGCAGTCGGGCTGGATGCTCTGTGGCNTTTCTTGGGTCCTCATGC^ 

ACAGCTCCAGGAACCTTGAAGCCAATCTGGGGACTTTCAGATGTTTC 

CAAACTTCCTGCTACACATGCCCTGAATGAATTGCTAAATTT^ 

TAAGGATGTACAAAAGTATGTCTGCATCGATGTCTGTACTGTAAATTTCTAATTTATCACTGTAC 
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AAAGAAAACCCCTTGCTATTTAATTTO 
AA 

Figure 34 

ACCCAGGGACGGAGGACCCAGGCTGGCTTGGGGACTGTCTGCTCTTCTCGGCGGGAGCCGTGG 

AGAGTCCTTTCCCTGGAATCCGAGCCCTAACCGTCTCTCCCCAGCCCTATCCGGCGAGGAGCGG 

AGCGCTGCCAGCGGAGGCAGCGCCTTCCCGAAGCAGTTTATCTTTGGACGGTmCTTTAAAGG 

AAAAACGAACCAACAGGTTGCCAGCCCCGGCGCCACACACGAGACGCCGGAGGGAGAAGCCC 

CGGCCCGGATTCCTCTGCCTGTGTGCGTCCCTCGCGGGCTGCTGGAGGCGAGGGGAGGGAGGG 

GGCGATGGCTCGGCCTGACCCATCCGCGCCGCCCTCGCTGTTGCTGCTGCTCCTGGCGCAGCTG 

GTGGGCCGGGCGGCCGCCGCGTCCAAGGCCCCGGTGTGCCAGGAAATCACGGTGCCCATGTGC 

CGCGGCATCGGCTACAACCTGACGCACATGCCCAACCAGTTCAACCACGACACGCAGGACGAG 

GCGGGCCTGGAGGTGCACCAGTTCTGGCCGCTGGTGGAGATCCAATGCTCGCCGGACCTGCGCT 

TCTTCCTATGCACTATGTACACGCCCATCTGTCTGCCCGACTACCACAAGCCGCTGCCGCCCTGC 

CGCTCGGTGTGCGAGCGCGCCAAGGCCGGCTGCTCGCCGCTGATGCGCCAGTACGGCTTCGCCT 

GGCCCGAGCGCATGAGCTGCGACCGCCTCCCGGTGCTGGGCCGCGACGCCGAGGTCCTCTGCA 

TGGATTACAACCGCAGCGAGGCCACCACGGCGCCCCCCAGGCCTTTCCCAGCCAAGCCCACCCT 

TCCAGGCCCGCCAGGGGCGCCGGCCTCGGGGGGCGAATGCCCCGCTGGGGGCCCGTTCGTGTG 

CAAGTGTCGCGAGCCCTTCGTGCCCATTCTGAAGGAGTCACACCCGCTCTACAACAAGGTGCGG 

ACGGGCCAGGTGCCCAACTGCGCGGTACCCTGCTACCAGCCGTCCTTCAGTGCCGACGAGCGC 

ACGTTCGCCACCTTCTGGATAGGCCTGTGGTCGGTGCTGTGCTTCATCTCCACGTCCACCACAGT 

GGCCACCTTCCTCATCGACATGGACACGTTCCGCTATCCTGAGCGCCCCATCATCTTCCTGTCAG 

CCTGCTACCTGTGCGTGTCGCTGGGCTTCCTGGTGCGTCTGGTCGTGGGCCATGCCAGCGTGGC 

CTGCAGCCGCGAGCACAACCACATCCACTACGAGACCACGGGCCCTGCACTGTGCACCATCGT 

CTTCCTCCTGGTCTACTTCTTCGGCATGGCCAGCTCCATCTGGTGGGTCATCCTGTCGCTCACCT 

GGTTCCTGGCCGCCGCGATGAAGTGGGGCAACGAGGCCATCGCGGGCTACGGCCAGTACTTCC 

ACCTGGCTGCGTGGCTCATCCCCAGCGTCAAGTCCATCACGGCACTGGCGCTGAGCTCCGTGGA 

CGGGGACCCAGTGGCCGGCATCTGCTACGTGGGCAACCAGAACCTGAACTCGCTGCGGCGCTT 

CGTGCTGGGCCCGCTGGTGCTCTACCTGCTGGTGGGCACGCTCTTCCTGCTGGCGGGCTTCGTGT 

CGCTCTTCCGCATCCGCAGCGTCATCAAGCAGGGCGGCACCAAGACGGACAAGCTGGAGAAGC 

TCATGATCCGCATCGGCATCTTCACGCTGCTCTACACGGTCCCCGCCAGCATTGTGGTGGCCTG 

CTACCTGTACGAGCAGCACTACCGCGAGAGCTGGGAGGCGGCGCTCACCTGCGCCTGCCCGGG 

CCACGACACCGGCCAGCCGCGCGCCAAGCCCGAGTACTGGGTGCTCATGCTCAAGTACTTCATG 

TGCCTGGTGGTGGGCATCACGTCGGGCGTCTGGATCTGGTCGGGCAAGACGGTGGAGTCGTGG 

CGGCGTTTCACCAGCCGCTGCTGCTGCCGCCCGCGGCGCGGCCACAAGAGCGGGGGCGCCATG 

GCCGCAGGGGACTACCCCGAGGCGAGCGCCGCGCTCACAGGCAGGACCGGGCCGCCGGGCCCC 

GCCGCCACCTACCACAAGCAGGTGTCCCTGTCGCACGTGTAGGAGGCTGCCGCCGAGGGACTC 

GGCCGGAGAGCTGAGGGGAGGGGGGCGTTITGTTTGGTAGTTTTGCCAAGGTCACTTCCGTTTA 

CCTTCATGGTGCTGTTGCCCCCTCCCGCGGCGACTTGGAGAGAGGGAAGAGGGGCGTTTTCGAG 

GAAGAACCTGTCCCAGGTCTTCTCCAAGGGGCCCAGCTCACGTGTATTCTATTTTGCGTTTCTTA 

CCTGCCTTCTTTATGGGAACCCTCTTTTTAATTTATATGTAT 



Figure 35 

GCAGCTCCAGTCCCGGACGCAACCCCGGAGCCGTCTCAGGTCCCTGGGGGGAACGGTGGGTTA 

GACGGGGACGGGAAGGGACAGCGGCCTTCGACCGCCCCCCGAGTAATTGACCCAGGACTCATT 

TTCAGGAAAGCCTGAAAATGAGTAAAATAGTGAAATGAGGAATTTGAACATTTTATCTTTGGAT 

GGGGATCTTCTGAGGATGCAAAGAGTGATTCATCCAAGCCATGTGGTAAAATCAGGAATTTGA 

AGAAAATGGAGATGTTTACATTTITGTTGACGTGTATTTTTCTACCCCTCCTAAGAGGGCACAGT 

CTCTTCACCTGTGAACCAATTACTGTTCCCAGATGTATGAAAATGGCCTACAACATGACGTTTTT 

CCCTAATCTGATGGGTCATTATGACCAGAGTATTGCCGCGGTGGAAATGGAGCATTTTCTTCCT 

CTCGCAAATCTGGAATGTTCACCAAACATTGAAACTTTCCTCTGCAAAGCATTTGTACCAACCT 

GCATAGAACAAATTCATGTGGTTCCACCTTGTCGTAAACnTTGTGAGAAAGTATATTCTGATTG 
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CAAAAAATTAATTGACACTTTTGGGATCCGATGGCCTGAGGAGCTTGAATGTGACAGATTACAA 

TACTGTGATGAGACTGTTCCTGTAACTTTTGATCCACACACAGAATTTCTTGGTCCTCAGAAGA 

AAACAGAACAAGTCCAAAGAGACATTGGATTTTGGTGTCCAAGGCATCTTAAGACTTCTGGGG 

GACAAGGATATAAGTTTCTGGGAATTGACCAGTGTGCGCCTCCATGCCCCAACATGTATTTTAA 

AAGTGATGAGCTAGAGTTTGCAAAAAGTTITATTGGAACAGTTTCAATATTTTGTCm 

ACTCTGTTCACATTCCTTACTTTTTTAATTGATGTTAGAAGATTCAGATACCCAGAGAGACCAAT 

TATATATTACTCTGTCTGTTACAGCATTGTATCTCT 

TAGCACAGCCTGCAATAAGGCAGATGAGAAGCTAGAACTTGGTGACACTGTTGTCCTAGGCTCT 
CAAAATAAGGCTTGCACCGTTTTGTTCATGCTTTTGTATTTTTTC^ 

GTGGGTGATTCTTACCATTACTTGGTTCTTAGCTGCAGGAAGAAAATGGAGTTGTGAAGCCATC 

GAGCAAAAAGCAGTGTGGTTTCATGCTGTTGCATGGGGAACACCAGGTTTCCTGACTGTTATGC 

TTCTTGCTCTGAACAAAGTTGAAGGAGACAACATTAGTGGAGTTTGCTITGTTGGCCTTTATGA 

CCTGGATGCTTCTCGCTACTITGTACTCTTGCCACTGTGCCTTTGTGTGTTTGTTGGGCTCTCTCT 

TCTITrAGCTGGCATTATTTCCTTAAATCATGTTCGACAAGTCATACAACATGATGGCCGGAACC 

AAGAAAAACTAAAGAAATTTATGATTCGAATTGGAGTCTTCAGCGGCTTGTATCTTGTGCCATT 

AGTGACACTTCTCGGATGTTACGTCTATGAGCAAGTGAACAGGATTACCTGGGAGATAACTTGG 

GTCTCTGATCATTGTCGTCAGTACCATATCCCATGTCCTTATCAGGCAAAAGCAAAAGCTCGAC 

CAGAATTGGCmATTTATGATAAAATACCTGATGACATTAATTGTTGGCATCTCTGCTGTCTTC 

TGGGTTGGAAGCAAAAAGACATGCACAGAATGGGCTGGGTTTTrTAAACGAAATCGCAAGAGA 

GATCCAATCAGTGAAAGTCGAAGAGTACTACAGGAATCATGTGAGTTTTTCTTAAAGCACAATT 

CTAAAGTTAAACACAAAAAGAAGCACTATAAACCAAGTTCACACAAGCTGAAGGTCATTTCCA 

AATCCATGGGAACCAGCACAGGAGCTACAGCAAATCATGGCACTTCTGCAGTAGCAATTACTA 

GCCATGATTACCTAGGACAAGAAACTTTGACAGAAATCCAAACCTCACCAGAAACATCAATGA 

GAGAGGTGAAAGCGGACGGAGCTAGCACCCCCAGGTTAAGAGAACAGGACTGTGGTGAACCT 

GCCTCGCCAGCAGCATCCATCTCCAGACTCTCTGGGGAACAGGTCGACGGGAAGGGCCAGGCA 

GGCAGTGTATCTGAAAGTGCGCGGAGTGAAGGAAGGATTAGTCCAAAGAGTGATATTACTGAC 

ACTGGCCTGGCACAGAGCAACAATTTGCAGGTCCCCAGTTCTTCAGAACCAAGCAGCCTCAAA 

GGTTCCACATCTCTGCITGTTCACCCAGTTTCAGGAGTGAGAAAAGAGCAGGGAGGTGGTTGTC 

ATTCAGATACTTGAAGAACATTTTCTCTCGTTACTCAGAAGCAAATTTGTGTTACACTGGAAGT 

gacctatgcactgttttgtaagaatcactgttacgttcttcttitgcacitaaagttgca™ 
tactgttatactggaaaaaatagagttcaagaataatatgactcatttcacacaaaggttaatg 

ACAACAATATAeCTGAAAACAGAAATGTGCAGGTTAATAATATITmTAATAGTGTGGGAGGA 

CAGAGTTAGAGGAATCTTCCTTTTCTATITATGAAGATrCTACTCTTGGTAAGAGTATTrTAAGA 

TGTACTATGCTATITrACCTTTTTGATATAAAATCAAGATATTrCTTTGCTGAAG 

TATCCTTGTATCTTTTTATACATATTTGAAAATAAGCTTATATGTATTTGAACl 1 1 1 iiGAAATCC 

TATTCAAGTATTTTTATCATGCTATTGTGATATTTTAGCACTTTGGTAGCTITrACA 

TAAGAAAATTGTAAAATAGTCTTCTTTTATACTGTAAAAAAAGATATACCAAAAAGTCTTATAA 

TAGGAATTTAACITrAAAAACCCACTrATTGATACCTrACCATCTAAAATGTGTGATTTTTAT^ 

TCTCGTTTTAGGAATTTCACAGATCTAAATrATGTAACTGAAATAAGGTGCTTACTCAAAGAGT 

GTCCACTATTGATTGTATTATGCTGCTCACTGATCCT^ 

AGGGTTAGTAGACAAAATGTTAGTCTTTTGTATATTAGGCCAAGTGCAATTGACTTCCuililll 

AATGTTTCATGACCACCCATTGATTGTATTATAACCACTrACAGTTGCTTATAlllllTGTTTTAA 

CTITrGTTTCTTAACATTTAGAATATrACATTTrGTATTATACAGTACCTT^ 



AG 



Figure 36 



CTCTCCCAACCGCCTCGTCGCACTCCTCAGGCTGAGAGCACCGCTGCACTCGCGGCCGGCGATG 
CGGGACCCCGGCGCGGCCGCTCCGCTTTCGTCCCTGGGCCTCTGTGCCCTGGTGCTGGCGCTGC 
TGGGCGCACTGTCCGCGGGCGCCGGGGCGCAGCCGTACCACGGAGAGAAGGGCATCTCCGTGC 
CGGACCACGGCTTCTGCCAGCCCATCTCCATCCCGCTGTGCACGGACATCGCCTACAACCAGAC 
CATCCTGCCCAACCTGCTGGGCCACACGAACCAAGAGGACGCGGGCCTCGAGGTGCACCAGTT 
CTACCCGCTGGTGAAGGTGCAGTGTTCTCCCGAACTCCGCTTTTTCTTATGCTCCATGTATGCGC 
CCGTGTGCACCGTGCTCGATCAGGCCATCCCGCCGTGTCGTTCTCTGTGCGAGCGCGCCCGCCA 
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GGGCTGCGAGGCGCTCATGAACAAGTTCGGCTTCCAGTGGCCCGAGCGGCTGCGCTGCGAGAA 

CTTCCCGGTGCACGGTGCGGGCGAGATCTGCGTGGGCCAGAACACGTCGGACGGCTCCGGGGG 

CCCAGGCGGCGGCCCCACTGCCTACCCTACCGCGCCCTACCTGCCGGACCTGCCCTTCACCGCG 

CTGCCCCCGGGGGCCTCAGATGGCAGGGGGCGTCCCGCCTTCCCCTTCTCATGCCCCCGTCAGC 

TCAAGGTGCCCCCGTACCTGGGCTACCGCTTCCTGGGTGAGCGCGATTGTGGCGCCCCGTGCGA 

ACCGGGCCGTGCCAACGGCCTGATGTACTTTAAGGAGGAGGAGAGGCGCTTCGCCCGCCTCTG 

GGTGGGCGTGTGGTCCGTGCTGTGCTGCGCCTCGACGCTCTTTACCGTTCTCACCTACCTGGTGG 

ACATGCGGCGCTTCAGCTACCCAGAGCGGCCCATCATCTTCCTGTCGGGCTGCTACTTCATGGT 

GGCCGTGGCGCACGTGGCCGGCTTCCTTCTAGAGGACCGCGCCGTGTGCGTGGAGCGCTTCTCG 

GACGATGGCTACCGCACGGTGGCGCAGGGCACCAAGAAGGAGGGCTGCACCATCCTCTTCATG 

GTGCTCTACTTCCTCGGCATGGCCAGCTCCATCTGGTGGGTCATTCTGTCTCTCACTTGGTTCCT 

GGCGGCCGGCATGAAGTGGGGCCACGAGGCCATCGAGGCCAACTCGCAGTACTTCCACCTGGC 

CGCGTGGGCCGTGCCCGCCGTCAAGACCATCACTATCCTGGCCATGGGCCAGGTAGACGGGGA 

CCTGCTGAGCGGGGTGTGCTACGTTGGCCTCTCCAGTGTGGACGCGCTGCGGGGCTTCGTGCTG 

GCGCCTCTGTrCGTCTACCTCTTCATAGGCACGTCCTTCTTGCTGGCCGGCTrCGTGTCCCTCTTC 

CGTATCCGCACCATCATGAAACACGACGGCACCAAGACCGAGAAGCTGGAGAAGCTCATGGTG 

CGCATCGGCGTCTTCAGCGTGCTCTACACAGTGCCCGCCACCATCGTCCTGGCCTGCTACTTCTA 

CGAGCAGGCCTTCCGCGAGCACTGGGAGCGCACCTGGCTCCTGCAGACGTGCAAGAGCTATGC 

CGTGCCCTGCCCGCCCGGCCACTTCCCGCCCATGAGCCCCGACTTCACCGTCTTCATGATCAAG 

TACCTGATGACCATGATCGTCGGCATCACCACTGGCTTCTGGATCTGGTCGGGCAAGACCCTGC 

AGTCGTGGCGCCGCTTCTACCACAGACTTAGCCACAGCAGCAAGGGGGAGACTGCGGTATGAG 

CCCCGGCCCCTCCCCACCTTTCCCACCCCAGCCCTCTTGCAAGAGGAGAGGCACGGTAGGGAAA 

AGAACTGCTGGGTGGGGGCCTGTTTCTGTAACTTTCTCCCCCTCTACTGAGAAGTGACCTGGAA 

GTGAGAAGTTCTTTGCAGATTTGGGGCGAGGGGTGATTTGGAAAAGAAGACCTGGGTGGAAAG 

CGGTTTGGATGAAAAGATTTCAGGCAAAGACTTGCAGGAAGATGATGATAACGGCGATGTGAA 

TCGTCAAAGGTACGGGCCAGCTTGTGCCTAATAGAAGGTTGAGACCAGCAGAGACTGCTGTGA 

GTTTCTCCCGGCTCCGAGGCTGAACGGGGACTGTGAGCGATCCCCCTGCTGCAGGGCGAGTGGC 

CTGTCCAGACCCCTGTGAGGCCCCGGGAAAGGTACAGCCCTGTCTGCGGTGGCTGCTTTGTTGG 

AAAGAGGGAGGGCCTCCTGCGGTGTGCTTGTCAAGCAGTGGTCAAACCATAATCTCTTTTCACT 

GGGGCCAAACTGGAGCCCAGATGGGTTAATTTCCAGGGTCAGACATTACGGTCTCTCCTCCCCT 

GCCCCCTCCCGCCTGTTTTTCCTCCCGTACTGCTTTCAGGTCTTGTAAAATAAGCATTTGGAAGT 

CTTGGGAGGCCTGCCTGCTAGAATCCTAATGTGAGGATGCAAAAGAAATGATGATAACATTTTG 

AGATAAGGCCAAGGAGACGTGGAGTAGGTATTTTTGCTACTTTTTCATTTTCTGGGGAAGGCAG 

GAGGCAGAAAGACGGGTGTTTTATTTGGTCTAATACCCTGAAAAGAAGTGATGACTrGTTGCrT 

TTCAAAACAGGAATGCATTTTTCCCCTTGTCTITGTTGTAAGAGACAAAAGAGGAAACAAAAGT 

GTCTCCCTGTGGAAAGGCATAACTGTGACGAAAGCAACTTTT ATAGGC AAAGCAGCGCAAATC 

TGAGGTTTCCCGTTGGTTGTTAAlTTGGTTGAGATAAACArrCCTTTTTAAGGAAAAGTGAAGA 

GCAGTGTGCTGTCACACACCGTrAAGCCAGAGGTTCTGACTTCGCTAAAGGAAATGTAAGAGG 

TTTTGTTGTCTGTTTTAAATAAATITAATTCGGAACACATGATCCAACAGACTATG1TAAAATAT 

TCAGGGAAATCTCTCCCTTCATTTACTTTrrCTTGCTATAAGCCTATATTTAGGT^ 

TTTTTTCTCCCATTTGGATCCTTTGAGGTAAAAAAACATAATGTCTT 

AAGTTAATTAAAAAAAAAAAGCAAAGAGCCATTTTGTCCTGTTTTCTTGGTT 

TTATTAAACATCATCCATATGCTGACCCTGTCTCTGTGTGGTTGGGTTGGGAGGCGATCAGCAG 

ATACCATAGTGAACGAAGAGGAAGGTTTGAACCATGGGCCCCATCTTTAAAGAAAGTCATTAA 

AAGAAGGTAAACTTCAAAGTGATTCTGGAGTTCTTTGAAATGTGCTGGAAGACTTAAATTTATT 

AATCTTAAATCATGTACTTTTmCTGTAATAGAACTCGGATTCTTT^ 

TTAGCAGAGAATCATGGGAGCTAACCTTTATCCCACCTTTGACACTACCCTCCAATCTTGCAAC 

ACrATCCTGTTTCTCAGAACAGTTTTTAAATGCCAATCATAGAGGGTACTGTAAAGTGTACAAG 

rrACTTTATATATGTAATGTTCACTTGAGTGGAACTGCTTTTTACATTAAAGTTAAAATCGATCT 

TGTGTTTCTTCAACCTTCAAAACTATCTCATCTGTCAGATTTTTAAAACTCCAACACAGGT^ 

GCATCTTTTGTGCTGTATCITITAAGTGCATGTGAAATTTGTAAAATAGAGATAAGTACAGTAT 

GTATATTTTGTAAATCTCCCATTTTTGTAAGAAA^ 

ATTTITGTTTTGTTGGCTTTAAAGGTCTACCCCACTTTATCACATGTACAGATCACAAATAA^ 
TTTTTAAATAC 
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ACAGCATGGAGTGGGGTTACCTGTTGGAAGTGACCTCGCrGCTGGCCGCCITGGCGCTGCTGCA 

GCGCTCTAGCGGCGCTGCGGCCGCCTCGGCCAAGGAGCTGGCATGCCAAGAGATCACCGTGCC 

GCTGTGTAAGGGCATCGGCTACAACTACACCTACATGCCCAATCAGTTCAACCACGACACGCA 

AGACGAGGCGGGCCTGGAGGTGCACCAGTTCTGGCCGCTGGTGGAGATCCAGTGCTCGCCCGA 

TCTCAAGTTCTTCCTGTGCAGCATGTACACGCCCATCTGCCTAGAGGACTACAAGAAGCCGCTG 

CCGCCCTGCCGCTCGGTGTGCGAGCGCGCCAAGGCCGGCTGCGCGCCGCTCATGCGCCAGTAC 

GGCTTCGCCTGGCCCGACCGCATGCGCTGCGACCGGCTGCCCGAGCAAGGCAACCCTGACACG 

CTGTGCATGGACTACAACCGCACCGACCTAACCACCGCCGCGCCCAGCCCGCCGCGCCGCCTGC 

CGCCGCCGCCGCCCGGCGAGCAGCCGCCTTCGGGCAGCGGCCACGGCCGCCCGCCGGGGGCCA 

GGCCCCCGCACCGCGGAGGCGGCAGGGGCGGTGGCGGCGGGGACGCGGCGGCGCCCCCAGCT 

CGCGGCGGCGGCGGTGGCGGGAAGGCGCGGCCCCCTGGCGGCGGCGCGGCTCCCTGCGAGCCC 

GGGTGCCAGTGCCGCGCGCCTATGGTGAGCGTGTCCAGCGAGCGCCACCCGCTCTACAACCGC 

GTCAAGACAGGCCAGATCGCTAACTGCGCGCTGCCCTGCCACAACCCCTITrTCAGCCAGGACG 

AGCGCGCOTCACCGTCrTCTGGATCGGCCTGTGGTCGGTGCTCTGCTTCGTGTCCACCTTCGCC 

ACCGTCTCCACCTTCCTTATCGACATGGAGCGCTTCAAGTACCCGGAGCGGCCCATTATCTTCCT 

CTCGGCCTGCTACCTCTTCGTGTCGGTGGGCTACCTAGTGCGCCTGGTGGCGGGCCACGAGAAG 

GTGGCGTGCAGCGGTGGCGCGCCGGGCGCGGGGGGCGCTGGGGGCGCGGGCGGCGCGGCGGC 

GGGCGCGGGCGCGGCGGGCGCGGGCGCGGGCGGCCCGGGCGGGCGCGGCGAGTACGAGGAGC 

TGGGCGCGGTGGAGCAGCACGTGCGCTACGAGACCACCGGCCCCGCGCTGTGCACCGTGGTCT 

TCTTGCTGGTCTACTTCTTCGGCATGGCCAGCTCCATCTGGTGGGTGATCTTGTCGCTCACATGG 

TTCCTGGCGGCCGGTATGAAGTGGGGCAACGAAGCCATCGCCGGCTACTCGCAGTACTTCCACC 

TGGCCGCGTGGCTTGTGCCCAGCGTCAAGTCCATCGCGGTGCTGGCGCTCAGCTCGGTGGACGG 

CGACCCGGTGGCGGGCATCTGCTACGTGGGCAACCAGAGCCTGGACAACCTGCGCGGCTTCGT 

GCrGGCGCCGCTGGTCATCTACCTOTCATCGGCACCATGTTCCTGCTGGCCGGCTTCGTGTCCC 

TGTTCCGCATCCGCTCGGTCATCAAGCAACAGGACGGCCCCACCAAGACGCACAAGCTGGAGA 

AGCTGATGATCCGCCTGGGCCTGTTCACCGTGGTCTACACCGTGCCCGCCGCGGTGGTGGTCGC 

CTGCCTCTTCTACGAGCAGCACAACCGCCCGCGCTGGGAGGCCACGCACAACTGCCCGTGCCTG 

CGGGACCTGCAGCCCGACCAGGCACGCAGGCCCGACTACGCCGTCTTCATGCTCAAGTACTTCA 

TGTGCCTAGTGGTGGGCATCACCTCGGGCGTGTGGGTCTGGTCCGGCAAGACGCTGGAGTCCTG 

GCGCTCCCTGTGCACCCGCTGCTGCTGGGCCAGCAAGGGCGCCGCGGTGGGCGGGGGCGCGGG 

CGCCACGGCCGCGGGGGGTGGCGGCGGGCCGGGGGGCGGCGGCGGCGGGGGACCCGGCGGCG 

GCGGGGGGCCGGGCGGCGGCGGGGGCTCCCTCTACAGCGACGTCAGCACTGGCCTGACGTGGC 

GGTCGGGCACGGCGAGCTCCGTGTCTTATCCAAAGCAGATGCCATTGTCCCAGGTCTGAGCGGA 

GGGGAGGGGGCGCCCAGGAGGGGTGGGGAGGGGGGCGAGGAGACCCAAGTGCAGCGAAGGG 

ACACTTGATGGGCTGAGGTTCCCACCCCTTCACAGTGTTGATTGCTATTAGCATGATAATGAAC 

TCTTAATGGTATCCATTAGCTGGGACTTAAATGACTCACTTAGAACAAAGTACCTGGCATTGAA 

GCCTCCCAGACCCAGCCCCTTTTCCTCCATTGATGTGCGGGGAGCTCCTCCCGCCACGCGTTAAT 

TTCTGTTGGCTGAGGAGGGTGGACTCTGCGGCGTTTCCAGAACCCGAGATTTGGAGCCCTCCCT 

GGCTGCACTTGGCTGGGTTTGCAGTCAGATACACAGATTTCACCTGGGAGAACCTCrrrriCTCC 

CTCGACTCTTCCTACGTAAACTCCCACCCCTGACTTACCCTGGAGGAGGGGTGACCGCCACCTG 

ATGGGATTGCACGGTTTGGGTATTCITAATGACCAGGCAAATGCCrrTAAGTAAACAAACAAGA 

AATGTCITAATTATACACCCCACGTAAATACGGGTTTCTTACATTAGAGGATGTATTTATATAAT 

TATTTGTTAAATTGTAAAAAAAAAAAGTGTAAAATATGTATATATCCAAAGATATAGTGTGTAC 

ATTTTTTTGTAAAAAGTTTAGAGGCTTACCCC^ 

ATAAAATGACITITGATAAATGATTTAACCATTGCCCTCTCCCCCGCCTCTTCTGAGCTGTCACC 

TTTAAAGTGCTTGCTAAGGACGCATGGGGAAAATGGACATTTTCTGGCTTGTCATTCTGTACAC 

TGACCTTAGGCATGGAGAAAATTACTTGTTAAACTCrAGTTCTTAAGTTGTTAGCCAAGTAAAT 

ATCATTGTTGAACTGAAATCAAAATTGAGTTTlTGCACCTrCCCCAAAGACGGTGTTTTTCATGG 

GAGCTCTITrCTGATCCATGGATAACAACrCTCACrTTAGTGGATGTAAATGGAACrTCTGCAA 

GGCAGTAATTCCCCTTAGGCCTTGTTATTTATCCTGCATGGTATCACTAAAGGTTTCAAAACCCT 

GAAAAAAAA 
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CCGCCTTCGGCCCGGGCCTCCCGGGATGGCCGTGGCGCCTCTGCGGGGGGCGCTGCTGCTGTGG 

CAGCTGCTGGCGGCGGGCGGCGCGGCACTGGAGATCGGCCGCTTCGACCCGGAGCGCGGGCGC 

GGGGCTGCGCCGTGCCAGGCGGTGGAGATCCCCATGTGCCGCGGCATCGGCTACAACCTGACC 

CGCATGCCCAACCTGCTGGGCCACACGTCGCAGGGCGAGGCGGCTGCCGAGCTAGCGGAGTTC 

GCGCCGCTGGTGCAGTACGGCTGCCACAGCCACCTGCGCTTCTTCCTGTGCTCGCTCTACGCGC 

CCATGTGCACCGACCAGGTCTCGACGCCCATTCCCGCCTGCCGGCCCATGTGCGAGCAGGCGCG 

CCTGCGCTGCGCGCCCATCATGGAGCAGTTCAACTTCGGCTGGCCGGACTCGCTCGACTGCGCC 

CGGCTGCCCACGCGCAACGACCCGCACGCGCTGTGCATGGAGGCGCCCGAGAACGCCACGGCC 

GGCCCCGCGGAGCCCCACAAGGGCCTGGGCATGCTGCCCGTGGCGCCGCGGCCCGCGCGCCCT 

CCCGGAGACCTGGGCCCGGGCGCGGGCGGCAGTGGCACCTGCGAGAACCCCGAGAAGTTCCAG 

TACGTGGAGAAGAGCCGCTCGTGCGCACCGCGCTGCGGGCCCGGCGTCGAGGTGTTCTGGTCC 

CGGCGCGACAAGGACTTCGCGCTGGTCTGGATGGCCGTGTGGTCGGCGCTGTGCTTCTTCTCCA 

CCGCCTTCACTGTGCTCACCTTCTTGCTGGAGCCCCACCGCTTCCAGTACCCCGAGCGCCCCATC 

ATCTTCCTCTCCATGTGCTACAACGTCTACTCGCTGGCCTTCCTGATCCGTGCGGTGGCCGGAGC 

GCAGAGCGTGGCCTGTGACCAGGAGGCGGGCGCGCTCTACGTGATCCAGGAGGGCCTGGAGAA 

CACGGGCTGCACGCTGGTCrTCCTACTGCTCTACTACTTCGGCATGGCCAGCTCGCTCTGGTGG 

GTGGTCCTGACGCTCACCTGGTTCCTGGCTGCCGGGAAGAAATGGGGCCACGAGGCCATCGAG 

GCCCACGGCAGCTATTTCCACATGGCTGCCTGGGGCCTGCCCGCGCTCAAGACCATCGTCATCC 

TGACCCTGCGCAAGGTGGCGGGTGATGAGCTGACTGGGCTTTGCTACGTGGCCAGCACGGATG 

CAGCAGCGCTCACGGGCTTCGTGCTGGTGCCCCTCTCTGGCTACCTGGTGCTGGGCAGTAGTTT 

CCTCCTGACCGGCTTCGTGGCCCTCTTCCACATCCGCAAGATCATGAAGACGGGCGGCACCAAC 

ACAGAGAAGCTGGAGAAGCTCATGGTCAAGATCGGGGTCTTCTCCATCCTCTACACGGTGCCCG 

CCACCTGCGTCATCGTTTGCTATGTCTACGAACGCCTCAACATGGACTTCTGGCGCCTTCGGGCC 

ACAGAGCAGCCATGCGCAGCGGCCGCGGGGCCCGGAGGCCGGAGGGACTGCTCGCTGCCAGG 

GGGCrCGGTGCCCACCGTGGCGGTCTTCATGCTCAAAATTTTCATGTCACTGGTGGTGGGGATC 

ACCAGCGGCGTCTGGGTGTGGAGCTCCAAGACTTTCCAGACCTGGCAGAGCCTGTGCTACCGCA 

AGATAGCAGCTGGCCGGGCCCGGGCCAAGGCCTGCCGCGCCCCCGGGAGCTACGGACGTGGCA 

CGCACTGCCACTATAAGGCTCCCACCGTGGTCTTGCACATGACTAAGACGGACCCCTCTTTGGA 

GAACCCCACACACCTCTAGCCACACAGGCCTGGCGCGGGGTGGCTGCTGCCCCCTCCTTGCCCT 

CCACGCCCTGCCCCCTGCATCCCCTAGAGACAGCTGACTAGCAGCTGCCCAGCTGTCAAGGTCA 

GGCAAGTGAGCACCGGGGACTGAGGATCAGGGCGGGACCCCGTGAGGCTCATTAGGGGAGAT 

GGGGGTCTCCCCTAATGCGGGGGCTGGACCAGGCTGAGTCCCCACAGGGTCCTAGTGGAGGAT 

GTGGAGGGGCGGGGCAGAGGGGTCCAGCCGGAGTTTATTTAATGATGTAATTTATTGTTGCGTT 

CCTCTGGAAGCTGTGACTGGAATAAACCCCCGCGTGGCACTGCTGATCCTCTCTGGCTGGGAAG 

GGGGAAGGTAGGAGGTGAGGC 



Figure 39 

ACACGTCCAACGCCAGCATGCAGCGCCCGGGCCCCCGCCTGTGGCTGGTCCTGCAGGTGATGG 

GCTCGTGCGCCGCCATCAGCTCCATGGACATGGAGCGCCCGGGCGACGGCAAATGCCAGCCCA 

TCGAGATCCCGATGTGCAAGGACATCGGCTACAACATGACTCGTATGCCCAACCTGATGGGCC 

ACGAGAACCAGCGCGAGGCAGCCATCCAGTTGCACGAGTTCGCGCCGCTGGTGGAGTACGGCT 

GCCACGGCCACCTCCGCTTCTTCCTGTGCTCGCTGTACGCGCCGATGTGCACCGAGCAGGTCTC 

TACCCCCATCCCCGCCTGCCGGGTCATGTGCGAGCAGGCCCGGCTCAAGTGCTCCCCGATTATG 

GAGCAGTTCAACTTCAAGTGGCCCGACTCCCTGGACTGCCGGAAACTCCCCAACAAGAACGAC 

CCCAACTACCTGTGCATGGAGGCGCCCAACAACGGCTCGGACGAGCCCACCCGGGGCTCGGGC 

CTGTTCCCGCCGCTGTTCCGGCGGCAGCGGCCCCACAGCGCGCAGGAGCACCCGCTGAAGGAC 

GGGGGCCCCGGGCGCGGCGGCTGCGACAACCCGGGCAAGTTCCACCACGTGGAGAAGAGCGC 

GTCGTGCGCGCCGCTCTGCACGCCCGGCGTGGACGTGTACTGGAGCCGCGAGGACAAGCGCTT 

CGCAGTGGTCTGGCTGGCCATCTGGGCGGTGCTGTGCTTCTTCTCCAGCGCCTTCACCGTGCTCA 

CCTTCCTCATCGACCCGGCCCGCTTCCGCTACCCCGAGCGCCCCATCATCTTCCTCTCCATGTGC 

TACTGCGTCTACTCCGTGGGCTACCTCATCCGCCTCTTCGCCGGCGCCGAGAGCATCGCCTGCG 

ACCGGGACAGCGGCCAGCTCTATGTCATCCAGGAGGGACTGGAGAGCACCGGCTGCACGCTGG 

TCTTCCTGGTCCTCTACTACTTCGGCATGGCCAGCTCGCTGTGGTGGGTGGTCCTCACGCTCACC 
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TGGTTCCTGGCCGCCGGCAAGAAGTGGGGCCACGAGGCCATCGAAGCCAACAGCAGCTACTTC 

CACCTGGCAGCCTGGGCCATCCCGGCGGTGAAGACCATCCTGATCCTGGTCATGCGCAGGGTG 

GCGGGGGACGAGCTCACCGGGGTCTGCTACGTGGGCAGCATGGACGTCAACGCGCTCACCGGC 

TTCGTGCTCATTCCCCTGGCCTGCTACCTGGTCATCGGCACGTCCTTCATCCTCTCGGGCTTCGT 

GGCCCTGTTCCACATCCGGAGGGTGATGAAGACGGGCGGCGAGAACACGGACAAGCTGGAGA 

AGCTCATGGTGCGTATCGGGCTCTTCTCTGTGCTGTACACCGTGCCGGCCACCTGTGTGATCGCC 

TGCTACTTTTACGAACGCCTCAACATGGATTACTGGAAGATCCTGGCGGCGCAGCACAAGTGCA 

AAATGAACAACCAGACTAAAACGCTGGACTGCCTGATGGCCGCCTCCATCCCCGCCGTGGAGA 

TCITCATGGTGAAGATCTTTATGCTGCTGGTGGTGGGGATCACCAGCGGGATGTGGATTTGGAC 

CTCCAAGACTCTGCAGTCCTGGCAGCAGGTGTGCAGCCGTAGGTTAAAGAAGAAGAGCCGGAG 

AAAACCGGCCAGCGTGATCACCAGCGGTGGGATTTACAAAAAAGCCCAGCATCCCCAGAAAAC 

TCACCACGGGAAATATGAGATCCCTGCCCAGTCGCCCACCTGCGTGTGAACAGGGCTGGAGGG 

AAGGGCACAGGGGCGCCCGGAGCTAAGATGTGGTGCTTTTCTTGGTTGTGTTTTTCTTTCTTOT 

CTrCTITITTTTTITITrATAAAAGCAAAAGAGAAATACAT 

AGGATGCTGTGATACACTGAAAGGAAAAATGTACTTAAAGGGTTTTC 

AGCGAAGGGAAGCTCCTCCAGTGAAGTAGCCTCTTGTGTAACTAATTTGTGGTAAAGTAGTTGA 

TTCAGCCCTCAGAAGAAAACTTTTGTTTAGAGCCCTCCGTAAATATACATCTGTGTATTTGAGTT 

GGCTTTGCTACCCATTTACAAATAAGAGGACAGATAACTGCTTTGCAAATTCAAGAGCCTCCCC 

TGGGTTAACAAATGAGCCATCCCCAGGGCCCACCCCCAGGAAGGCCACAGTGCTGGGCGGCAT 

CCCTGCAGAGGAAAGACAGGACCCGGGGCCCGCCTCACACCCCAGTGGATTTGGAGTTGCTTA 

AAATAGACTCTGGCCTTCACCAATAGTCTCTCTGCAAGACAGAAACCTCCATCAAACCTCACAT 

ITGTGAACTCAAACGATGTGCAATACATTTTTTTCTCTTTCCTTGAAAATAAA^ 

GTATTTTGCTATATATAAAGACAACAAAAGAAATCTCCTAACAAAAGAACTAAGAGGCCCAGC 

CCTCAGAAACCCTTCAGTGCTACATTTTGTGGCTTTITAATGGAAACCAAGCCAATGTTATAGA 

CGTTTGGACTGATTTGTGGAAAGGAGGGGGGAAGAGGGAGAAGGATCATTCAAAAGTTACCCA 

AAGGGCTTATTGACTCTTTCTATTGTTAAACAAATGATTTCCACAAACAGATCAGGAAGCACTA 

GGTTGGCAGAGACACTTTGTCTAGTGTATTCTCTTCACAGTGCCAGGAAAGAGTGGTTTCTGCG 

TGTGTATATTTGTAATATATGATATTTTC 

TTAAAAAAA 

Figure 40 

CCTGCAGCCTCCGGAGTCAGTGCCGCGCGCCCGCCGCCCCGCGCCTTCCTGCTCGCCGCACCTC 

CGGGAGCCGGGGCGCACCCAGCCCGCAGCGCCGCCTCCCCGCCCGCGCCGCCTCCGACCGCAG 

GCCGAGGGCCGCCACTGGCCGGGGGGACCGGGCAGCAGCTTGCGGCCGCGGAGCCGGGCAAC 

GCTGGGGACTGCGCCTTTTGTCCCCGGAGGTCCCTGGAAGTTTGCGGCAGGACGCGCGCGGGG 

AGGCGGCGGAGGCAGCCCCGACGTCGCGGAGAACAGGGCGCAGAGCCGGCATGGGCATCGGG 

CGCAGCGAGGGGGGCCGCCGCGGGGCCCTGGGCGTGCTGCTGGCGCTGGGCGCGGCGCTTCTG 

GCCGTGGGCTCGGCCAGCGAGTACGACTACGTGAGCTTCCAGTCGGACATCGGCCCGTACCAG 

AGCGGGCGCTTCTACACCAAGCCACCTCAGTGCGTGGACATCCCCGCGGACCTGCGGCTGTGCC 

ACAACGTGGGCTACAAGAAGATGGTGCTGCCCAACCTGCTGGAGCACGAGACCATGGCGGAGG 

TGAAGCAGCAGGCCAGCAGCTGGGTGCCCCTGCTCAACAAGAACTGCCACGCCGGGACCCAGG 

TCTTCCTCTGCTCGCTCTTCGCGCCCGTCTGCCTGGACCGGCCCATCTACCCGTGTCGCTGGCTC 

TGCGAGGCCGTGCGCGACTCGTGCGAGCCGGTCATGCAGTTCTTCGGCTTCTACTGGCCCGAGA 

TGCTTAAGTGTGACAAGTTCCCGGAGGGGGACGTCTGCATCGCCATGACGCCGCCCAATGCCAC 

CGAAGCCTCCAAGCCCCAAGGCACAACGGTGTGTCCTCCCTGTGACAACGAGTTGAAATCTGA 

GGCCATCATTGAACATCTCTGTGCCAGCGAGTTTGCACTGAGGATGAAAATAAAAGAAGTGAA 

AAAAGAAAATGGCGACAAGAAGATTGTCCCCAAGAAGAAGAAGCCCCTGAAGTTGGGGCCCA 

TCAAGAAGAAGGACCTGAAGAAGCTTGTGCTGTACCTGAAGAATGGGGCTGACTGTCCCTGCC 

ACCAGCTGGACAACCTCAGCCACCACTTCCTCATCATGGGCCGCAAGGTGAAGAGCCAGTACTT 

GCTGACGGCCATCCACAAGTGGGACAAGAAAAACAAGGAGTTCAAAAACTTCATGAAGAAAA 

TGAAAAACCATGAGTGCCCCACCTTTCAGTCCGTGTTTAAGTGATTCTCCCGGGGGCAGGGTGG 

GGAGGGAGCCTCGGGTGGGGTGGGAGCGGGGGGGACAGTGCCCGGGAACCCGTGGTCACACA 

CACGCACTGCCCTGTCAGTAGTGGACATTGTAATCCAGTCGGCTTGTTCTTGCAGCATTCCC 
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CCCTTTCCCTCCATAGCCACGCTCCAAACCCCAGGGTAGCCATGGCCGGGTAAAGCAAGGGCC 
ATTTAGATTAGGAAGGTTTTTAAGATC 

TGACAAACCATTTCCAACAGCAACACAGCCACTAAAACACAAAAAGGGGGATTGGGCGGA^ 
GTGAGAGCCAGCAGCAAAAACTACATTTTGCAACTTGTTGGTGTGGATCTATTGGCTGATCT 
GCCTTTCAACTAGAAAATTCTAATGATTGGCAAGTCACGTTGTTTTC^ 
TTCTGTCTGCTTTAAATC 

TGATAAGTGCAGGGAGGAAAAGTGCAAGTCCATTATCTAATAGTGACAGCAAAGGGACCAGGG 
GAGAGGCATTGCCTTCTCTGCCCACAGTC^ 

TCTCAGATGCCCCAAAGTTTCGGTTCCTATGAGCCCGGGGCATGATCTGATCCCCAAGACATGT 

GGAGGGGCAGCCTGTGCCTGCCTTTGTGTCAGAAAAAGG 

CGGCGATTTTCGGGCTGAGAAGGCAGTAGTTTTCAAAACACATAGTTA 



Figure 41 

GAATTCGTTCAGCCTGGTTAAGTCCAAGCT 

GAGCTGCGCGCGGGCTTGCAGCGCCTCGCCCGCGCTGTCCTCCCGGTGTCCCGCTTCTCCGCGC 

CCCAGCCGCCGGCTGCCAGCTTTTCGGGGCCCCGAGTCGCACCCAGCGAAGAGAGCGGGCCCG 

GGACAAGCTCGAACTCCGGCCGCCTCGCCCTTAACCAGCTCCGTCCCTCTACCCCCT 

GCGCCCACGATGCTGCAGGGCCCTGGCTCGCTGCTGCTGCTCTTCCTCGCCTCGCACTC 

GGGCTCGGCGCGCGGGCTCTTCCTCTTTGGCCAGCCCGACTTCTCCTACAAGCGCAGCAA^ 

AAGCCCATCCCGGCCAACCTGCAGCTGTGCCACGGCATCGAATACCAGAACATGCGGCTGCCC 

AACCTGCTGGGCCACGAGACCATGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTG 

GTCATGAAGCAGTGCCACCCGGACACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCC 

TCGATGACCTAGACGAGACCATCCAGCCATGCCACTCTCGNTGCGTGCAGGTGAAGGATCGCT 

GCGCCCCGGTCATGTCCGCCTTCCCCTGGCCCGACATGCTTGAGTGCGACCGTTTCCCCCAGGA 

CAACGACCTTTGCATCCCCCTCGCTAGCAGCGACCACCTCCTGCCAGCCACCGAGGAAGCTCCA 

AAGGTATGTGAAGCCTGCAAAAATAAAAATGATGATC^ 

AAAAATGATTTTGCACTGAAAATAAAAGTGAAGGAGATAACCTACATCAACCGT 



Figure 42 

CCGGGTCGGAGCCCCCCGGAGCTGCGCGCGGGCTTGCAGCGCCTCGCCCGCGCTGTCCTCCCGGTGTCCC 
GCTTCTCCGCGCCCCAGCCGCCGGCTGCCAGCTTTTCGGGGCCCCGAGTCGCACCCAGCGAAGAGAGCGG 
GCCCGGGACAAGCTCGAACTCCGGCCGCCTCGCCCTTCCCCGGCTCCGCTCCCTCTGCCCCCTCGGGGTC 
GCGCGCCCACGATGCTGCAGGGCCCTGGCTCGCTGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCTGGG 
CTCGGCGCGCGGGCTCTTCCTCTTTGGCCAGCCCGACTTCTCCTACAAGCGCAGC^TTGCAAGCCCATC 
CCTGCCAACCTGCAGCTGTGCCACGGCATCGAATACCAGAACATGCGGCTGCCCAACCTGCTGGGCCACG 
AGACCATGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTGGTCATGAAGCAGTGCCACCCGGA 
CACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCCTCGATGACCTAGACGAGACCATCCAGCCA 
TGCCACTCGCTCTGCGTGCAGGTGAAGGACCGCTGCGCCCCGGTCATGTCCGCCTTCGGCTTCCCCTGGC 
CCGACATGCTTGAGTGCGACCGTTTCCCCCAGGACAACGACCTTTGCATCCCCCTCGCTAGCAGCGACCA 
CCTCCTGCCAGCCACCGAGGAAGCTCCAAAGGTATGTGAAGCCTGCAAAAATAAAAATGATGATGACAAC 
GACATAATGGT^AACGCTTTGTAAAAATGATTTTGCACTGAAAATAAAAGTGAAGGAGATAACCTACATCA 
ACCGAGATACCAAAATCATCCTGGAGACCAAGAGCAAGACCATTTACAAGCTGAACGGTGTGTCCGAAAG 
GGACCTGAAGAAATCGGTGCTGTGGCTCAAAGACAGCTTGCAGTGCACCTGTGAGGAGATGAACGACATC 
AACGCGCCCTATCTGGTCATGGGACAGAAACAGGGTGGGGAGCTGGTGATCACCTCGGTGAAGCGGTGGC 
AGAAGGGGCJVGAGAGAGTTC^GCGCATCTCCCGCAGCATCCGCAAGCTGCAGTGCTAGTCCCGGCATCC 
TGATGGCTCCGACAGGCCTGCTCCAGAGCACGGCTGACCATTTCTGCTCCGGGATCTCAGCTCCCGTTCC 
CCAAGCACACTCCTAGCTGCTCCAGTCTCAGCCTGGGCAGCTTCCCCCTGCCTTTTGCACGTTTGCATCC 
C CAGCATTT C CTGAGTTATAAGGC CACAGGAGTGGATAGCTGTTTTCACCTAAAGGAAAAGCCCAC C CGA 
ATCTTGTAGAAATATTCAAACTAATAAAATCATGAATATTTTTATGAAGTTT 



WO 03/012082 



PCT/GB02/03409 



Figure 43 36/41 

ACGGGGCCTGGGCGGSAGGGGCGGTGGCTGGAGCTCGGTAAAGCTCGTGGGACCCCATTGGGG 

GAATTTGATCCAAGGAAGCGGTGATTGCCGGGGGAGGAGAAGCTCCCAGATCCTTGTGTCCAC 

TTGCAGCGGGGGAGGCGGAGACGCGGAGCGGGCCTTTTGGCGTCCACTGCGCGGCTGCACCCT 

GCCCCATCCTGCCGGGATCATGGTCTGCGGCAGCCCGGGAGGGATGCTGCTGCTGCGGGCCGG 

GCTGCTTGCCCTGGCTGCTCTCTGCCTGCTCCGGGTGCCCGGGGCTCGGGCTGCAGCCTGTGAG 

CCCGTCCGCATCCCCCTGTGCAAGTCCCTGCCCTGGAACATGACTAAGATGCCCAACCACCTGC 

ACCACAGCACTCAGGCCAACGCCATCCTGGCCATCGAGCAGTTCGAAGGTCTGCTGGGCACCC 

ACTGCAGCCCCGATCTGCTCTTCTTCCTCTGTGCCATGTACGCGCCCATCTGCACCATTGACTTC 

CAGCACGAGCCCATCAACCCCTGTAAGTCTGTGTGCGAGCGGGCCCGGCAGGGCTGTGAGCCC 

ATACTCATCAAGTACCGCCACTCGTGGCCGGAGAACCTGGCCTGCGAGGAGCTGCCAGTGTAC 

GACAGGGGCGTGTGCATCTCTCCCGAGGCCATCGTTACTGCGGACGGAGCTGATTTTCCTATGG 

ATTCTAGTAACGGAAACTGTAGAGGGGCAAGCAGTGAACGCTGTAAATGTAAGCCTATTAGAG 

CTACACAGAAGACCTATTTCCGGAACAATTACAACTATGTCATTCGGGCTAAAGTTAAAGAGAT 

AAAGACTAAGTGCCATGATGTGACTGCAGTAGTGGAGGTGAAGGAGATTCTAAAGTCCTCTCT 

GGTAAACATTCCACGGGACACTGTCAACCTCTATACCAGCTCTGGCTGCCTCTGCCCTCCACTT 

AATGTTAATGAGGAATATATCATCATGGGCTATGAAGATGAGGAACGTTCCAGATTACTCTTGG 

TGGAAGGCTCTATAGCTGAGAAGTGGAAGGATCGACTCGGTAAAAAAGTTAAGCGCTGGGATA 

TGAAGCTTCGTCATCTTGGACTCAGTAAAAGTGATTCTAGCAATAGTGATTCCACTCAGAGTCA 

GAAGTCTGGCAGGAACTCGAACCCCCGGCAAGCACGCAACTAAATCCCGAAATACAAAAAGTA 

ACACAGTGGACTTCCTATTAAGACTTACTTGCATTGCTGGACTAGCAAAGGAAAATTGCACTAT 

TGCACATCATATTCTATTGTTTACTATAAAAATC 

TITGGTTTCTGCTTCTCTCTTCTCTCAACCCCTITGTAATGGTITGGGGGCAGA 

TTGTGAGTTTTCTATTTCACTAATCATGAGAAAAACTGTTCTTTTGCA^ 

ATGCTGTTA 

Figure 44 

CAGCGGCCGCTGAATTCTAGGGCGGGTTCGCGCCCCGAAGGCTGAGAGCTGGCGCTGCTCGTG 

CCCTGTGTGCCAGACGGCGGAGCTCCGCGGCCGGACCCCGCGGCCCCGCTTTGCTGCCGACTGG 

AGTTTGGGGGAAGAAACTCTCCTGCGCCCCAGAAGATTTCTTCCTCGGCGAAGGGACAGCGAA 

AGATGAGGGTGGCAGGAAGAGAAGGCGCTTTCTGTCTGCCGGGGTCGCAGCGCGAGAGGGCA 

GTGCCATGTTCCTCTCCATCCTAGTGGCGCTGTGCCTGTGGCTGCACCTGGCGCTGGGCGTGCG 

CGGCGCGCCCTGCGAGGCGGTGCGCATCCCTATGTGCCGGCACATGCCCTGGAACATCACGCG 

GATGCCCAACCACCTGCACCACAGCACGCAGGAGAACGCCATCCTGGCCATCGAGCAGTACGA 

GGAGCTGGTGGACGTGAACTGCAGCGCCGTGCTGCGCTTCTTCTTCTGTGCCATGTACGCGCCC 

ATTTGCACCCTGGAGTTCCTGCACGACCCTATCAAGCCGTGCAAGTCGGTGTGCCAACGCGCGC 

GCGACGACTGCGAGCCCCTCATGAAGATGTACAACCACAGCTGGCCCGAAAGCCTGGCCTGCG 

ACGAGCTGCCTGTCTATGACCGTGGCGTGTGCATTTCGCCTGAAGCCATCGTCACGGACCTCCC 

GGAGGATGTTAAGTGGATAGACATCACACCAGACATGATGGTACAGGAAAGGCCTCTTGATGT 

TGACTGTAAACGCCTAAGCCCCGATCGGTGCAAGTGTAAAAAGGTGAAGCCAACTTTGGCAAC 

GTATCTCAGCAAAAACTACAGCTATGTTATTCATGCCAAAATAAAAGCTGTGCAGAGGAGTGG 

CTGCAATGAGGTCACAACGGTGGTGGATGTAAAAGAGATCTTCAAGTCCTCATCACCCATCCCT 

CGAACTCAAGTCCCGCTCATTACAAATTCTTCTTGCCAGTGTCCACACATCCTGCCCCATCAAG 

ATGTrCTCATCATGTGTTACGAGTGGCGTTCAAGGATGATGCTTCTTGAAAATTGCTTAGTTGAA 

AAATGGAGAGATCAGCTTAGTAAAAGATCCATACAGTGGGAAGAGAGGCTGCAGGAACAGCG 

GAGAACAGTTCAGGACAAGAAGAAAACAGCCGGGCGCACCAGTCGTAGTAATCCCCCCAAACC 

AAAGGGAAAGCCTCCTGCTCCCAAACCAGCCAGTCCCAAGAAGAACATTAAAACTAGGAGTGC 

CCAGAAGAGAACAAACCCGAAAAGAGTGTGAGCTAACTAGTTTCCAAAGCGGAGACTTCCGAC 

TTCCTTACAGGATGAGGCTGGGCATTGCCTGGGACAGCCTATGTAAGGCCATGTGCCCCTTGCC 

CTAACAACTCACTGCAGTGCTCTTCATAGACACATCTTGCAGCATTTTTCTTAAGGCTATGCTTC 

AGTTTTTCTTTGTAAGCCATCACAAGCCATAGTGGTAGGTTTGCCCrrTGGTACAGAAGGTGAG 

TTAAAGCTGGTGGAAAAGGCTTATTGCATTGCATTCAGAGTAACCTGTGTGCATACTCTAGAAG 

AGTAGGGAAAATAATGCTrGTTACAATTCGACCTAATATGTGCATTGTAAAATAAATGCCATAT 

TrCAAACAAAACACGTAATTTTTTTACAGTATGTTTTATTACCTTTTGATATCTGT^ 

GTTAGTGATGTTTTAAAATGTGATGAAAATATAATGTTTTTAAGAAGGAACAGTAGTGGAATGA 

ATGTTAAAAGATCTTTATGTGTTTATGGTCT 
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GAAAAATTAGAGAAGTAGCATATGGAAAAT^ 

TGTTTTrAGCTAGAAACTTAAAAACAAAAATAATAATAAAGAAAAATAAATAAAAAGGAGAGG 

CAGACAATGTCTGGATTCCTGTTTTTTGGTTACCTGATTTCCATGATCATGATGCTTCTTGTCAA 

CACCCTCTTAAGCAGCACCAGAAACAGTGAGTTTGTCTGTACCATTAGGAGTTAGGTACTAATT 

AGTTGGCTAATGCTCAAGTATTTTATACCCACAAGAGAGGTATGTCACTCATCTTACTTCCCAG 

GACATCCACCCTGAGAATAATTTGACAAGCTTAAAAATGGCCTTCATGTGAGTGCCAAATTTTG 

TTTTrCTTCATTTAAATATTTTCTTTGC 

GAGAGGAAAGTTGAGTTCCACCTCTGAAATGAGAATTACTTGACAGTTGGGATACTTTAATCAG 
AAAAAAAGAACTTATITGCAGCATTTTATCAACAAATTTCATAATTGTGGACAATTGGAGGCAT 
TTATTTTAAAAAACAATTTTATTGGCCT^ 

CAATAAATGCACAACGCCCAAAGGAAATAAAATCCTATCTAATCCTACTCTCCACTACACAGA 

GGTAATCACTATTAGTATTTTGGCATATTATTCTCCAGGTGTTTGCTTATGCACTTATAAAATGA 

TITGAACAAATAAAACTAGGAACCTGTATACATGTGTTTCATAACCTGCCTCCTTTGCTTGGCCC 

TITATTGAGATAAGTTITCCTGTCAAGAAAGCAGAAACCATCTCATTTCTAACAGCTGTGTTATA 

TTCCATAGTATGCATTACTCAACAAACTGTTGTGCTATTGGATACTTAGGTGGTTTCTTCACTGA 

CAATACTGAATAAACATCTCACCGGAATTC 

Figure 45 

AAGCTTGATATCGAATTCGCGGCCGCGTCGACGGGAGGCGCCAGGATCAGTCGGGGCACCCGC 

AGCGCAGGCTGCCACCCACCTGGGCGACCTCCGCGGCGGCGGCGGCGGCGGCTGGGTAGAGTC 

AGGGCCGGGGGCGCACGCCGGAACACCTGGGCCGCCGGGCACCGAGCGTCGGGGGGCTGCGC 

GGCGCGACCCTGGAGAGGGCGCAGCCGATGCGGGCGGCGGCGGCGGCGGGGGGCGTGCGGAC 

GGCCGCGCTGGCGCTGCTGCTGGGGGCGCTGCACTGGGCGCCGGCGCGCTGCGAGGAGTACGA 

CTACTATGGCTGGCAGGCCGAGCCGCTGCACGGCCGCTCCTACTCCAAGCCGCCGCAGTGCCTT 

GACATCCCTGCCGACCTGCCGCTCTGCCACACGGTGGGCTACAAGCGCATGCGGCTGCCCAA.ee 

TGCTGGAGCACGAGAGCCTGGCCGAAGTGAAGCAGCAGGCGAGCAGCTGGCTGCCGCTGCTGG 

CCAAGCGCTGCCACTCGGATACGCAGGTCTTCCTGTGCTCGCTCTTTGCGCCCGTCTGTCTCGAC 

CGGCCCATCTACCCGTGCCGCTCGCTGTGCGAGGCCGTGCGCGCCGGCTGCGCGCCGCTCATGG 

AGGCCTACGGCTTCCCCTGGCCTGAGATGCTGCACTGCCACAAGTTCCCCCTGGACAACGACCT 

CTGCATCGCCGTGCAGTTCGGGCACCTGCCCGCCACCGCGCCTCCAGTGACCAAGATCTGCGCC 

CAGTGTGAGATGGAGCACAGTGCTGACGGCCTCATGGAGCAGATGTGCTCCAGTGACTTTGTG 

GTCAAAATGCGCATCAAGGAGATCAAGATAGAGAATGGGGACCGGAAGCTGATTGGAGCCCA 

GAAAAAGAAGAAGCTGCTCAAGCCGGGCCCCCTGAAGCGCAAGGACACCAAGCGGCTGGTGC 

TGCACATGAAGAATGGCGCGGGCTGCCCCTGCCCACAGCTGGACAGCCTGGCGGGCAGCTTCC 

TG<jTCATGGGCCGCAAAGTGGATGGACAGCTGCTGCTCATGGCCGTCTACCGCTGGGACAAGA 

AGAATAAGGAGATGAAGTTTGCAGTCAAATTCATGTTCTCCTACCCCTGCTCCCTCTACTACCCT 

TTCTTCTACGGGGCGGCAGAGCCCCACTGAAGGGCACTCCTCCTTGCCCTGCCAGCT 

GCTTGCCCTCTGGCCCCGCCCCAACTTCCAGGCTGACCCGGCCCTACTGGAGGGTGTTTTCACG 

AATGTTGTTACTGGCACAAGGCCTAAGGGATGGGCACGGAGCCCAGGCTGTCCTTTTTGACCCA 

GGGGTCCTGGGGTCCCTGGGATGTTGGGCTTCCTCTCTCAGGAGCAGGGCTTCITCATCTGGGT 

GAAGACCTCAGGGTCTCAGAAAGTAGGCAGGGGAGGAGAGGGTAAGGGAAAGGTGGAGGGGC 

TCAGGGCACCCTGAGGCGGAGGTTTCAGAGTAGAAGGTGATGTCAGCTCCAGCTCCCCTCTGTC 

GGTGGTGGGGCCTCACCTTGAAGAGGGAAGTCTCAATATTAGGCTAAGCTATTTGGGAAAGTTC 

TCCCCACCGCCCCTGTACGCGTCATCCTAGCCCCCCTTAGGAAAGGAGTTAGGGTCTCAGTGCC 

TCCAGCCACACCCCCTGCCTTCCCCAGCTTGCCCATTTCCCTGCCCCAAGGCCCAGAGCTCCCCC 

CAGACTGGAGAGCAAGCCCAGCCCAGCCTCGGCATAGACCCCCTTCTGGTCCGCCCGTGGCTCG 

ATTCCCGGGATTCATTCCTCAGCCTCTGCTTCTCCCTTTTATCCCAAT 

TGAGGCCATAGGTACTAGACAACCAATACATGCAGGGTTGGGTTTTCTAAl 1 1 1 ill AACTTTTT 
AATTAAATCAAAGGTCGACGCGCGGCCGCGGAATTCCTGCAGCCCGGGGGATCCCCGGGTACC 

GAGCTCGAATTC 
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ATGCATCTCCTCTTATTTCAGCTGCT 

ATGGCCGCCAGAATCAGAGTTCTCTTTCCCCCGTACTCCTGCCAAGGAATCAAAGAGAGCTTCC 

CACAGGCAACCATGAGGAAGCTGAGGAGAAGCCAGATCTGTTTGTCGCAGTGCCACACCTTGT 

AGCCACCAGCCCTGCAGGGGAAGGCCAGAGGCAGAGAGAGAAGATGCTGTCCAGATTTGGCA 

GGTTCTGGAAGAAGCCTGAGAGAGAAATGCATCCATCCAGGGACTCAGATAGTGAGCCCTTCC 

CACCTGGGACCCAGTCCCTCATCCAGCCGATAGATGGAATGAAAATGGAGAAATCTCCTCTTCG 

GGAAGAAGCCAAGAAATTCTGGCACCACTTCATGTTCAGAAAAACTCCGGCTTCTCAGGGGGT 

CATCTTGCCCATCAAAAGCCATGAAGTACATTGGGAGACCTGCAGGACAGTGCCCTTCAGCCA 

GACTATAACCCACGAAGGCTGTGAAAAAGTAGTTGTTCAGAACAACCTTTGCTTTGGGAAATGC 

GGGTCTGTTCATTTTCCTGGAGCCGCGCAGCACTCCCATACCTCCTGCTCTCACTGTTTGCCTGC 

CAAGTTCACCACGATGCACTTGCCACTGAACTGCACTGAACTTTCCTCCGTGATCAAGGTGGTG 

ATGCTGGTGGAGGAGTGCCAGTGCAAGGTGAAGACGGAGCATGAAGATGGACACATCCTACAT 

GCTGGCrCCCAGGATTCCTTTATCCCAGGAGTTTCAGCTTGA 



Figure 47 

CGGCACGGTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATT^ 

TGAGTCCTTCTGAGATGATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGGT 

AGCGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTCAAT 

TCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTCTGCA 

GTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACAACTAC 

CAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGTCCCACC 

CGCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACGCTGCATG 

CGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTCTTCTGATCAAA 

ATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGATCATAGCACCTT 

GGATGGGTATTCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAAAGGACAAGAAGG 

TTCTGTTrGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGTCCA 

AGATCTGTAAACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCTCTC 

ATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTCTTGCCGGATACAGAAAGA 

TCACCATCAAGCCAGTAATTOTCTAGGCnTCACA(^GTCAGAGACACrAAACCAGCTATCCA 

AATGCAGTGAACTCCTTlTATATAATAGATGCTATGAAAACCTTrrATGACCTTCATCAACT^ 

TCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCATTCCAATAACACCITCCAAAAACCTG 

GAGTGTAAGAGCTTTGTTTCTTTATGGAACTCCCCTGTGATTC 

TTCTCAGTGTGGCACTTACCTGTAAATGCAATGAAACT^ 

CTGCCTATTTTTCCTCTTGTTATGTAAATTTTTGTACACATTGATTGTTATCTTG 

TTCTATATTGAACTGAAGTAAATCATTTCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCA 

TTTAATTCTAGAGTCTAGAACGCAAGGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGT 

AACATGAAAATACTAGCTTATTTTCTGAAATGTACTATOT 

AGGCTGTGATAGTTTTTGAAATAAAATTTAACATTTAATATCATGAAATGTTATAAGTAGACAT 



Figure 48 

GCGGGTCTCGCITGGGTTCCGCTAATTTCTGTCCTGAGGCGTGAGACTGAGTTCATAGGGTCCT 

GGGTCCCCGAACCAGGAAGGGTTGAGGGAACACAATCTGCAAGCCCCCGCGACCCAAGTGAGG 

GGCCCCGTGTTGGGGTCCTCCCTCCCTTTGCATTCCCACCCCTCCGGGCTTTGCGTCTTCCTGGG 

GACCCCCTCGCCGGGAGATGGCCGCGTTGATGCGGAGCAAGGATTCGTCCTGCTGCCTGCTCCT 

ACTGGCCGCGGTGCTGATGGTGGAGAGCTCACAGATCGGCAGTTCGCGGGCCAAACTCAACTC 

CATCAAGTCCTCTCTGGGCGGGGAGACGCCTGGTCAGGCCGCCAATCGATCTGCGGGCATGTAC 

CAAGGACTGGCATTCGGCGGCAGTAAGAAGGGCAAAAACCTGGGGCAGGCCTACCCTTGTAGC 

AGTGATAAGGAGTGTGAAGTTGGGAGGTATTGCCACAGTCCCCACCAAGGATCATCGGCCTGC 

ATGGTGTGTCGGAGAAAAAAGAAGCGCTGCCACCGAGATGGCATGTGCTGCCCCAGTACCCGC 

TGCAATAATGGCATCTGTATCCCAGTTACTGAAAGCATCTTAACCCCTCACATCCCGGCTCTGG 

ATGGTACTCGGCACAGAGATCGAAACCACGGTCATTACTCAAACCATGACTTGGGATGGCAGA 

ATCTAGGAAGACCACACACTAAGATGTCACATATAAAAGGGCATGAAGGAGACCCCTGCCTAC 

GATCATCAGACTGCATTGAAGGGTITrGCTGTGCTCGTCATTTCTGGACCAAAATCTGCAAACC 
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AGTGCTCCATCAGGGGGAAGTCTGTACCAAACAACGCAAGAAGGGTTCTCATGGGCTGGAAAT 

TITCCAGCGTTGCGACTGTGCGAAGGGCCTGTCTTGCAAAGTATGGAAAGATGCCACCTACTCC 

TCCAAAGCCAGACTCCATGTGTGTCAGAAAATTTGATCACCATTGAGGAACATCATCAATTGCA 

GACTGTGAAGTTGTGTATTTAATGCATTATAGCATGGTGGAAAATAAGGTTCAGATGCAGAAG 

AATGGCTAAAATAAGAAACGTGATAAGAATATAGATGATCAC 



Figure 49 

CTATCACAATGAGACCAACACAGACACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGA 

AATTCACAAGATAACCAACAACCAGACTGGACAAATGGTCTTTTCAGAGACAGTTATCACATCT 

GTGGGAGACGAAGAAGGCAGAAGGAGCCACGAGTGCATCATCGACGAGGACTGTGGGCCCAG 

CATGTACTGCCAGTTTGCCAGCTTCCAGTACACCTGCCAGCCATGCCGGGGCCAGAGGATGCTC 

TGCACCCGGGACAGTGAGTGCTGTGGAGACCAGCTGTGTGTCTGGGGTCACTGCACCAAAATG 

GCCACCAGGGGCAGCAATGGGACCATCTGTGACAACCAGAGGGACTGCCAGCCGGGGCTGTGC 

TGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGCACACCCCTGCCCGTGGAGGGCGAGCTTT 

GCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCACCTGGGAGCTAGAGCCTGATGGAGCCTT 

GGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGCCCCACAGCCACAGCCTGGTGTATGTG 

TGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGGGAGATCCTGCTGCCCAGAGAGGTC 

CCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCGCCAGGAGCTGGAGGACCTGGAG 

AGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTGCCGCCGCTGCACTGCTGGGA 

GGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAATAGAAATAGCTAATTTAT 

TTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTrCCTACATCTTCTrCCCAGTAAG 

TTTCCCCTCTGGCnTGACAGCATGAGGTGTTGTGCATTTGTTCAGCTCCCCCAGGCTGTTCTCCA 

GGCrTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCAGGAGCAGTTTGCCACC 

CCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCGTTTGTTCTACATGGCT 

TTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTCTGATTGGTTTTGGGG 

AAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATGCAACAAATGAATT 

TTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGCAGATGAAATGTTC 

TGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCCTACCTCTGTGCCA 

GGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGGAGGGGGTCATTG 

TTCTCCTCGTCCATCAGGGATCTCAGAGGNCTCAGAGACTGCAAGCTGCTTGCCCAAGTCACAC 

AGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCTCTCCACTAC 

CCCACACCAGCCITGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGATITITCTTT 

TGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAACTACT 

GTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGATATTG 

ACACTGTCCCTCTITGGCAGTrGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCATAC 

AGGTTAACCTGCAGAAACAGTACTTAGGTAATTGTAGGGCGAGGATTATAAATGAAATTTGCA 

AAATCACTTAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGG 

CTGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGA 

TGTTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTrAAAGTTTAAAGTTG 

CACATGATTGTATAAGCATGCTTTCTTTGAGTTTTAAATTATGTATAAACATAAGTTGCAT^ 

AAATCAAGCATAAATCAC 



Figure 50 

AGACGACGTGCTGAGCTGCCAGCTTAGTGGAAGCTCTGCTCTGGGTGGAGAGCAGCCTCGCrrTT 

GGTGACGCACAGTGCTGGGACCCTCCAGGAGCCCCGGGATTGAAGGATGGTGGCGGCCGTCCT 

GCTGGGGCTGAGCTGGCTCTGCTCTCCCCTGGGAGCTCTGGTCCTGGACTTCAACAACATCAGG 

AGCTCTGCTGACCTGCATGGGGCCCGGAAGGGCTCACAGTGCCTGTCTGACACGGACTGCAAT 

ACCAGAAAGTTCTGCCTCCAGCCCCGCGATGAGAAGCCGTTCTGTGCTACATGTCGTGGGTTGC 

GGAGGAGGTGCCAGCGAGATGCCATGTGCTGCCCTGGGACACTCTGTGTGAACGATGTTTGTAC 

TACGATGGAAGATGCAACCCCAATATTAGAAAGGCAGCTTGATGAGCAAGATGGCACACATGC 

AGAAGGAACAACTGGGCACCCAGTCCAGGAAAACCAACCCAAAAGGAAGCCAAGTATTAAGA 

AATCACAAGGCAGGAAGGGACAAGAGGGAGAAAGTTGTCTGAGAACTTTTGACTGTGGCCCTG 
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GACTTTGCTGTGCTCGTCATTTTTC 
CTGCTCCAGAAGAGGGCATAAAGACACTGCTCAAGCTCC 
GGCCCTGGACTACTGTGTCGAAGCCAATTGACCAGCAA^ 
GCCAAAAAATAGAAAAGCTATAAATATTTCAAAATAAAGAAG 



Figure 51 

AGGCAGAATACTTCTATGAATTCCTGTCCTTGCGCTCCCTGGATAAAG 
AACCGTCAATGTCCCTCTGCTGGGAACAGTGCC 
CCATGTCTTGGAAAACAGGATGGGGTGGCAGCATTTC 
AAGGCAACACCATTCTCCAAACACCTCAAAATGCTATCTTCTTTAA^ 

GTGCCCAGGCGGGTGCCGAAATGGAGGCTTTTGTAATGAAAGACGCATCTGCGAGTGTCCTGA 
TGGGTTCCACGGACCTCACTGTGAGAAAGCCCTTTGTACCCCACGATGTATGAATGGTGGACTT 
TGTGTGACTCCTGGTTTCTGCATCTGCCCACCTGGATTCTATGGAGTGAACTGTGACAAAGCAA 
ACTGCTCAACCACCTGCTTTAATGGAGGGACCT^ 

GGACTAGAGGGAGAGCAGTGTGAAATCAGCAAATGCCCACAACCCTGTCGAAATGGAGGTAA 

ATGCATTGGTAAAAGCAAATGTAAGTGTTCCAAAGGTTACCAGGGAGACCTCTGTTCAAAGCCT 

GTCTGCGAGCCTGGCTGTGGTGCACATGGAACCTGCCATGAACCCAACAAATGCCAATGTCAA 

GAAGGTTGGCATGGAAGACACTGCAATAAAAGGTACGAAGCCAGCCTCATACATGCCCTGAGC 

GCAGCAGCGCCCAGCTCAGGCAGCACACGCCITCACTTAAAAAGGCCGAGGAGCGGCGGCATC 

CACCTGAATCCAATTACATCTGGTGAACTCCGACATCTGAAACGTTTTAAGTTAC 

ATAGCCTTTGTTAACCTTTCATGTGTTGAATGTTCAAATAATGTTCA 

GCCTGAATTTTATTAGCTTCATTATAAATCACTGAGCTGATATTTACT 

AAGTACGTCTGTAGCATGATGGTATAGATTTTCTTGTTTCAGTGCTT^ 

ATGTCAATTGATCAGGTTAAAATTTTCAGTGTGTAGTTGGCAGATATTTT 

ATTTATGGTGTCTGGGGGCAGGGGAACATCAGAAAGGTTAAATTGGGCAAAAATGCGTAAGTC 

ACAAGAATTTGGATGGTGCAGTTAATGTTGA^ 

TTAGATGTTTGTTACATTTTTAAAAATTGCTCT^ 

CCTTACCATTATTCCAGAGATTCAGTATTAAAAAAAAAAAAATTACACTG 
AAACAATATAATATATTCTA^ 

GCTTGAAGCAATATAATATATTGTAAACAAAACACAGCTCTTACCTAATAAAC 
TTTGTATGTATAAAATAAAGGTGCTGCTTTAGTTTTC 



Figure 52 

ATGGGCATCGGGCGCAGCGAGGGGGGCCGCCGCGGGGCAGCCCTGGGCGTGCTGCTGGCGCTGGGCGCGG 
CGCTTCTGGCCGTGGGCTCGGCCAGCGAGTACGACTACGTGAGCTTCCAGTCGGACATCGGCCCGTACCA 
GAGCGGGCGCTTCTACACCAAGCCACCTCAGTGCGTGGACATCCCCGCGGACCTGCGGCTGTGCCACAAC 
GTGGGCTACAAGAAGATGGTGCTGCCCAACCTGCTGGAGCACGAGACCATGGCGGAGGTGAAGCAGCAGG 
CCAGCAGCTGGGTGCCCCTGCTCAACAAGAACTGCCACGCCGGCACCCAGGTCTTCCTCTGCTCGCTCTT 
CGCGCCCGTCTGCCTGGACCGGCCCATCTACCCGTGTCGCTGGCTCTGCGAGGCCGTGCGCGACTCGTGC 
GAGCCGGT CATGCAGTT CTTCGG CTT CTACTGGCCCGAGATG CTTAAGTGTGACAAGTTC C C CGAGGGGG 
ACGTCTGCATCGCCATGACGCCGCCC^TGCCACCGAAGCCTCCAAGCCCCAAGGCACAACGGTGTGTCC 
TCCCTGTGACAACGAGTTGAAATCTGAGGCCATCATTGAACATCTCTGTGCCAGCGAGTTTGGGCTGAGT 
TTAAAGATGATTGTGGGTAGCTCCCATAACTCATGCTGCACGCTGGGTCCTTCTCATCCCAACTCCTCAA 
AGCGGCAGGAGCAGGAACTGGGGACTCCTGAGAGAAGGCTTGGATATGGCCTTTTATTACACTTCATCCA 
AGGAAATCTGCCCCCACCCTGTGCCCAGGCCCGATCACGCATGAGGCTAAAGACGGAGGCCACTCCGCTG 
GCTCTGGGTAGATCTGCCCCTGGACTGTTTGCCGACTGCCCGGAGCGCCCTCTGCCGGTCTGCAGCTTCC 
CACACCACACGGAAGAAGTGGGGAAACTGAGGATACATTCTTTCCTCCTCCAGGTAAAGGGATTCTCAAT 
GAAGGGCTTGTGTGCACCTTCCACACTTAGATACCTCTACTACCTGAAAACCAGCATGCAGCATGTACAT 
CAAGAGTAC CAGGCACATAGTGCTCAAGT CTGGG CTAATATG CCACCTGCAGAGAGATGTAAAGATGAAG 
AAGACAAAGCCATGTTTTCAAAGTGA 
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GGCGGGTTCGCGCCCCGAAGGCTGAGAGCTGGCGCTGCTCGTGCCCTGTGTGCCAGACGGCGGAGCTCCG 
CGGCCGGACCCCGCGGCCCCGCTTTGCTGCCGACTGGAGTTTGGGGGAAGAAACTCTCCTGCGCCCCAGA 
AGATTTCTTCCTCGGCGAAGGGACAGCGAAAGATGAGGGTGGCAGGAAGAGAAGGCGCTTTCTGTCTGCC 
GGGGTCGCAGCGCGAGAGGGCAGTGCCATGTTCCTCTCCATCCTAGTGGCGCTGTGCCTGTGGCTGCACC 
TGGCGCTGGGCGTGCGCGGCGCGCCCTGCGAGGCGGTGCGCATCCCTATGTGCCGGCACATGCCCTGGAA 
CATCACGCGGATGCCCAACCACCTGCACCACAGCACGCAGGAGAACGCCATCCTGGCCATCGAGCAGTAC 
GAGGAGCTGGTGGACGTGAACTGCAGCGCCGTGCTGCGCTTCTTCTTCTGTGCCATGTACGCGCCCATTT 
GCACCCTGGAGTTCCTGCACGACCCTATCAAGCCGTGCAAGTCGGTGTGCCAACGCGCGCGCGACGACTG 
CGAGCCCCTCATGAAGATGTACAACCACAGCTGGCCCGAAAGCCTGGCCTGCGACGAGCTGCCTGTCTAT 
GACCGTGGCGTGTGCATTTCGCCTGAAGCCATCGTCACGGACCTCCCGGAGGATGTTAAGTGGATAGACA 
TCACACCAGACATGATGGTACAGGAAAGGCCTCTTGATGTTGACTGTAAACGCCTAAGCCCCGATCGGTG 
CAAGTGTAAAAAGGTGAAGCCAACTTTGGCAACGTATCTCAGCAAAAACTACAGCTATGTTATTCATGCC 
AAAATAAAAGCTGTGCAGAGGAGTGGCTGCAATGAGGT CACAACGGTGGTGGATGTAAAAGAGATCTT C A 
AGTCCTCATCACCCATCCCTCGAACTCAAGTCCCGCTCATTACAAATTCTTCTTGCCAGTGTCCACACAT 
CCTGCCCCATCAAGATGTTCTCATCATGTGTTACGAGTGGCGTTCAAGGATGATGCTTCTTGAAAATTGC 
TTAGTTGAAAAATGGAGAGAT CAGCTTAGTAAAAGATCCATACAGTGGGAAGAGAGGCTGCAGGAACAG C 
GGAGAACAGTTCAGGAGAAGAAGAAAACAGCCGGGCGCACCAGTC 

AAAGCCTCCTGCTCC CAAACCAGCCAGTCC CAAGAAGAACATTAAAACTAGGAGTGCC CAGAAGAGAACA 
AACCCGAAAAGAGTGTGAGCTAACTAGTTTCCAAAGCGGAGACTTCCGACTTCCTTACAGGATGAGGCTG 
GGCATTGCCTGGGACAGCCTATGTAAGGCCATGTGCCCCTTGCCCTAACAACTCACTGCAGTGCTCTTCA 
TAGACACATCTTGCAGCATTTTTCTTAAGGCTATGCTTCAGTTTTTCTTTGTAAGCCATCACAAGCCATA 
GTGGTAGGTTTGCCCTTTGGTACAGAAGGTGAGTTAAAGCTGGTGGAAAAGGCTTATTGCATTGCATTCA 
GAGTAACCTGTGTGCATACTCTAGAAGAGTAGGGAAAATAATGCTTGTTACAATTCGACCTAATATGTGC 
ATTGTAAAATAAATGC CATATTT CAAACAAAACACGTAATTTTTTTACAGTATGTTTTATTACCTTTTGA 
TATCTGTTGTTGCAATGTTAGTGATGTTTTAAAATGTGATGAAAATATAATGTTTTTAAGAAGGAACAGT 
AGTGGAATGAATGTTAAAAGATCTTTATGTGTTTATGGTCTGCAGAAGGATTTTTGTGATGAAAGGGGAT 
TTTTTGAAAAATTAGAGAAGTAGCATATGGAAAATTATAATGTGTTTTTTTACCAATGACTTCAGTTTCT 
GTTTTTAGCTAGAAACTTAAAAACAAAAATAATAATAAAGAAAAATAAATAAAAAGGAGAGGCAGACAAT 
GTCTGGATTCCTGTTTTTTGGTTACCTGATTTCCATGATCATGATGCTTCTTGTCAACACCCTCTTAAGC 
AGCACCAGAAACAGTGAGTTTGTCTGTACCATTAGGAGTTAGGTACTAATTAGTTGGCTAATGCTCAAGT 
ATTTTATACCCACAAGAGAGGTATGTCACTCATCTTACTTCCCAGGACATCCACCCTGAGAATAATTTGA 
CAAGCTTAAAAATGGCCTTCATGTGAGTGCCAAATTTTGTTTTTCTTCATTTAAATATTTTCTTTGCCTA 
AATACATGTGAGAGGAGTTAAATATAAATGTACAGAGAGGAAAGTTGAGTTCCACCTCTGAAATGAGAAT 
TACTTGACAGTTGGGATACTTTAATCAGAAAAAAAGAACTTATTTGCAGCATTTTATCAAC^AATTTCAT 
AATTGTGGACAATTGGAGGCATTTATTTTAAAAAACAATTTTATTGGCCTTTTGCTAACACAGTAAGCAT 
GTATTTTATAAGGCATTCAATAAATGCACAACGCCCAAAGGAAATAAAATCCTATCTAATCCTACTCTCC 
ACTACACAGAGGTAATCACTATTAGTATTTTGGCATATTATTCTCCAGGTGTTTGCTTATGCACTTATAA 
AATGATTTGAACAAATAAAACTAGGAACCTGTATACATGTGTTTCATAACCTGCCTCCTTTGCTTGGCCC 
T TTATTGAGATAAGTTTTCCTGTCAAGAAAGCAGAAAC CATCT CATTTCTAACAGCTGTGTTATATT CCA 
TAGTATGCATTACTCAACAAACTGTTGTGCTATTGGATACTTAGGTGGTTTCTTCACTGACAATACTGAA 
TAAACATCTCACCGGAATTC 

Figure 54 

GAGGCGCCTTGGGACCGCGTGGGAGCCGCAGCCGAACCGAGTAGGGACCGGGACCGCGCGGCGCCGCCG 

TCCCCGGCCGGGCCCGGCCCCCGCGAGCCGAGCGCGCGCCCCCGTCGCCCACCCGGGCGCGGCTGGATGC 

GGCGGGGTCCCCGCGGCGGCGACCCCCGGCCCCGAGCGCCCGGAGCGCCCAGAGGCGGCGTGCGGGGCC 

CGGGGACGCCGCGCCCTSTBGTGCGCCGAGGCGCGCCCCGAGACAGCCGGGGGCCCGCGCCGCAGCCGC 

CGCCCGCGCTGAGCCCCGGCCCGGCCCGCGGCCCGCGCCCGGCGGCAGCNTGAGCCAGGCCGAGCTGTC 

CACCTGCTCCGCGCCGCAGACGCAGCGCATCTTCCAGGAGGCTGTGCGCNAGGGCAACACGCAGGAGCT 

GCAGTYGCTGCTGCAGAACATGACCAACrGCGAGTTCAACGTGAACTCGTTCGGGCCCGAGGGCCAGAC 

GGCGCTGCACCAGTCGGTCATCGTCGGCAACCTGGTGCTCGTGAAGCTGCTGGTCAAGTTCGGCGCCGAC 

ATCCGCCTGGCCAACCGCGACGGCTGGAGCGCGCTGCAMATCGCCGCGTTCGGTGGCCACCAGGACATC 

GTGCTCTATCTCATCACCAAGGCGAAGTACGCGGCCAGCGCSGGTGTATGCCCGCCGGGACCCCGGACCC 

CGGCCCTGCGCCCGCGTCGTCTCTGCTGTACCTTCCCGCCAACTACCTCGGTGCGCGCMCGGCTCGCAGG 

CCCCGCCAGAAGGCCCGTGGCAACGGCGAATACGGCGCGTGCGTCMCGGCCCCAGGGTC 



